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ABSTRACT 


Modern communication practices make extensive uses of image data. The 
forbidding volumes of such data make it mandatory that methods for image data 
compression be considered. 

Certain models describing non-gaussian time series have been studied in 
literature. Gibson, LiX considered the lEAR(i) process, a first order AR process in 
which the excitation is absent with a prabability p. It was shown that the structure 
of this process makes it amenable to data compression purposes, and a predictive 
variable length compression scheme for the coding of this process was designed. 
Here, a new process, lEAR-IC<i) process has been constructed, and the feasibility of 
these two processes in image source modelling and coding have been explored. A 
large number of configurations in which compression schemes based on these 
processes can exist in image coding environment have been identified and the 
compression performances of various schemes compared. Both adaptive and non- 
adaptive methods have been considered. Comparison has been made with 3-bit DPCM. 

It has finally been shown, using experimental evidence, that the scheme 
yielding the best overall compression performance turns out to be a non-adaptive 
scheme which codes the prediction error values, obtained on using a predictor 
coeffecient of value i-O' using a simple variable length coding scheme. The above 
scheme turns out to be identical to the use of the same coding scheme in the case 
of a closely related DPCM scheme. 
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Chapter 1 


Introduction 


Data compression is the conversion of a stream of analog or very high rate 
discrete data into a stream of relatively low rate data for communication over a 
digital communication link or storage into a digital memory. The need for 
parsimonious representations of data is exemplified by image data, where 
broadcast television with a resolution of 512x512 pels/frame requires a rate of 
nearly 60x10^ bit/s at 30 frames/s and 8 bit/pel intensity resolution. These large 
memory and/or channel bandwidth requirements make it mandatory to consider data 
compression methods. 

Such conversion of a relatively high rate data to lower rate data virtually 
always entails a loss of fidelity or an increase in distortion. The efficiency of a 
data compression algorithm is measured by its data compressing ability ,the 
resulting distortion as well as by its implmentation complexity. Hence the goal of a 
source coding system is to obtain the best possible fidelity for a given rate, or 
equivalently to minimize the rate for a given fidelity. The complexity of data 
compression algorithms is an important consideration in their hardware 
implementation. 


Image data compression methods find their use in a variety of environments. 
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Applications in information transmission include broadcast television, remote 
sensing via satellite, radar, sonar, teleconferencing, computer communications, 
facisimile transmission, remotely piloted vehicles; and in information storage 
include educational and business documents, medical images used in patient 
monitoring systems and radioastronomy. 

1 .1 Review of Image coding methods 

This section reviews, extremely briefly, the methodologies prevelant in 
source coding of images. 

Image coding methods can be broadly classified into two subdivisions: 

i) Point processing image coding 
2) Spatial processing image coding 

1) Point processing image coding 

The simplest method in this class is pulse code modulation, PCM. Here the 
incoming signal is sampled, usually at the Nyquist rate, and each individual sampled 
is independently quantized. For adequate amplitude resolution 6-8 bits/pel are 
necessary. For less than 6 bits/pel a contouring effect becomes visible in the low 
detail regions. These contours may be broken by adding a small amount of pseudo- 
random noise to the luminance samples prior to quantization, and subtracting the 
same noise at the receiver. Alternative methods of combating conoturing are known 
e.g. coarse fine quantization [51. 

In PCM the conversion of continuous to discrete quantitities occurs in a 
memoryless fashion. Schreiber C73 has estimated, that for image data the first 


order entropy, HCxl , is 4.4 bits/pel whereas the second order entropy, HCxlx-il , is 



only 1.9 bits/pel. By exploiting the redundancy present in images, considerable 


improvement in compression can be achieved. Predictii/e techniques achieve this by 
decorrelating the succesive samples to be quantized. Here the sampled image, X; , , 
is represented in terms of another array, e: ^ , formed by subtracting the 
predicted value of x, ; from X; ^ , which has no redundancy and from which X; .; can 
be determined uniquely. Techniques such as DPCM try to minimize the distortion 
that results in quantizing e; .; rather than x: ,• [2,53. Distortion measures to be 
minimized may be the minimum mean square error criterion or certain more complex 
functions based on subjective or psychovisual fidelity such as contrast or 
visibility. The latter result in subjectively better images but are difficult to 
implement [2,3,4,83. 

The simplest predictive scheme is a delta modulation scheme. This uses a 
one bit quantizer. The predictor simply performs an integration of the quantizer 
output signal, which is a sequence of binary pulses. An ordinary delta modulation 
scheme suffers from slope overload errors, granularity noise and instability to 
transmission errors [53. 

The DPCH scheme is relatively free of the ills that plague the delta 
modulation .The DPCM scheme is illustrated in fig. 2.1 .The predictor here takes a 
linear combination of previously reconstructed values to form the estimate of a 
particular pixel, and the error is quantized, coded (usually to 2-4 bits) and 
transmitted. The quantizer should minimize the degradations associated with 
DPCM, viz., granularity, slope overload and 
edge busyness [5,63. Predicting Sq based on 
Sj^, the first order predictor, is simply 
implemented but suffers from blurring of 
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edges, edge busyness and low tolerance to 
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transmission errors. The subjective quality of an image and the susceptiblity to 
channel errors can be improved by two-dimensional prediction. For example, a 
predictor using Sj^ and for prediction reproduces vertical edges with much less 
blurring than a predictor using just S^. Predictive schemes give an average saving 
of 4-5 bits/pel over 8 bit PCM [2-61. The limitations of DPCM, apart from its high 
tolerance to transmission errors, are its sensitvity to variation in image 
statistics, which result in non-uniform error variances over different regions of 
an image [21. 

To take into account the non-stationarities of image statistics adaptii^e 
techniques may be used [51. Here the quantizer and/or the predictor is altered on 
a decision rule based on the local image properties. Adaptive predictors provide a 
much better subjective quality, especially at the edges. Here the image is 
segmented into regions and a quantizer is chosen for each region depending on the 
activity in it. Although these schemes involve a complex implementation, adaptive 
quantization coupled with a variable length coding scheme achieves a significant 
reduction in bit rates. Predictive techniques are discussed further in chapter 2. 

Other simple spatial domain techniques include run length coding where the 
number of bits between successive changes in detail or edges is transmitted, 
together with the amplitude of the edge. To achieve bandwidth compression, the 
runs of a picture are gathered and redistributed in a transmission line sequence, 
at a rate equal to the average rate of their occurance, and transmitted. As an 
example, for an edge probability of O.i, an average rate of i bit/pel is required 
compared to 3 bits/pel for PCM. 

Facsimile coding can be done using both run length encoding and predictive 


coding of data. From the type of data to be sent, estimates of run length 
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distributions are obtained, and Huffman code words are assigned to runs of white 
or black upto a maximum decided length. Use is made of the fact that run lengths 
are highly correlated in adjacent lines. An example of a simple predictive scheme 
in facsimile transmission is to declare the predicted value of a pel as the 
majority state of the pels in the prediction window. Alternatively, a predictor may 
be designed to minimize the prediction error probability C21. 

In broadcast television large redundancies exist between successive frames. 
An effective interframe coding technique, that combats errors like aliasing, 
contouring, and flickering, is the interlace method used in broadcast TV. This 
brings down the data rate by a factor of two, but requires the storage of half a 
frame at the receiver. Since the pel values in successive frames change only in 
areas of relative motion, predictive methods can be used here, and can be simple 
to implement compared with other methods. The conditional replenishment updates 
only pel values where substantial motion is discerned [51. It has also been found 
that spatial resolution in areas of relative motion and temporal resolution in 
stationary areas may be sacrificed without loss of fidelity. This property is 
utilized to achieve further compression. In areas of relative motion temporal 
prediction is poor and spatial prediction is used. Schemes with adequate buffer 
control have been suggested [2,61. More complex schemes classify the regions in 
successsive frames as stationary, slow or fast, and appropriately update the pel 
values by using a different predictor and/or quantizer in each case. Motion 
interpolation and compensation mehods try to predict the motion trajectories of 
each pel, and then propagate each pel along this trajectory in successive frames, 
which are not transmitted, till a new reference frame is reached. Details of 
displacement estimation techniques can be found in [41. 
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2) Spatial processing image coding 

The methods in . this family include transform coding techniques, 
interpolative methods, SVD (singular value decomposition) coding and feature 
coding methods. 

In source interpolatiue schemes, the luminance values of an image are 
approximated by continous functions within some continuous error band. 
Parameters describing these functions and the range over which they span the 
pixels is transmitted. The interpolation is usually limited to zeroth and first 
order polynomials since fitting higher order polynomials, like cubic splines, 
entails complexity. In destination interpolation the image is prefiltered and sampled 
at a rate below the Nyquist rate and the interpolation at the receiver is 
performed using sine or bessel waveforms [53. 

The method of transform coding yields impressive compression ratios. The 
basic premise of an image transform coding system is that in a two dimensional 
unitary transform of an image, nearly all the information is packed into a small 
number of samples. To achieve bandwidth reduction the low magnitude transform 
coeffecients can be disregarded without introducing serious image degradation. In 
optimum predictive coding successive inputs to the quantizer are whitened, and in 
general, the optimum predictor is a non-linear causal filter. Performing a unitary 
transform on an image implies that all the samples are uncorrelated jointly and 
then quantized. The Karhunen-Loei^e transform represents the image vector in a 
space whose basis vectors are the eigen-vectors of the covariance matrix of the 
image. This transform has the property that it packs more energy into a given 
number of samples than any other transform. Because of this reason, the K-L 
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transform is best suited to transform coding but it has the drawback that the 
statistical knowledge it requires is seldom available. Also, it does not possess a 
fast algorithm. It has been shown that the cosine transform approaches the 
efficiency of the K-L transform for Markov process data, and also has a fast 
algorithm. These reasons explain the extensive popularity of the cosine transform 
in image coding. The Hadamard and Haar Transforms are utilized because of their 
effecient computational algorithms [2-51. . 

In transform coding the coeffecients to be transmitted are selected using 
either of two methods given below; 

1) Zonal coding where all the coeffecients lying in a certain decided zone 
in the two dimensional transform domain are coded and transmitted. 

2) In threshold coding the samples selected for transmission are those which 
exceed a chosen threshold. 

The number of quantization levels are usually proportional to the estimated 
variance of the component and the number of code bits to its expected probability 
of occur ance. New approaches try to adapt the quantizer to the characteristics of 
the human visual perception rather than use the m.m.s.e. criterion. In threshold 
coding the additional brunt of the locations of the samples being transmitted must 
also be borne. Spectrum extrapolation techniques may be used in conjunction with 
transform coding. Here the receiver forms the estimates of the transform 
coeffecients that are not transmitted, before taking the inverse transform. In 
general transform coding gives much better compaction than predictive coding, 
especially at low distortion levels [21 and when the image data is non stationary. It 
also has a better transmission error tolerance, since the errors are distributed 


over the entire image. But it has the disadvantage of a formidable implementation 
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complexity. 

Adapth/e tr^sform coding tries to control the quantizing and coding of 
individual coeffecients. as a function of picture content or spectral energy such 
that the resulting distortion is kept below a visibility threshold fixed by a human 
observer. Another attempt to exploit the properties of human vision for transform 
coding is the M-transforn» [43. This transform uses the basis functions of a noise- 
like structure and generates quantization error patterns which are less visible 
than those of other transforms. 

Hybrid c€3ding schemes have been proposed which possess many attributes of 
transform coding, but are simpler to implement. Here a one dimensional transform 
is taken along each of the rows of the image block. Each transform coeffecient 
sequence is then coded independently by a one dimensional predictive technique 
such as DPCM in the orthogonal direction. Consider the jth row of an image Uj and 
its transform v^. Now DPCM is employed to encode the transform coeffecients in 
the following manner: 

Vj(i) = Vj_j^(i) + ej(i) , 1 i i ^ N 

where Vj(i) is the ith compnent of the vector Vj and ej(i) is the quantizer input. 
Hybrid schemes give good performance upto rates 1 bit /pel. 

All the above schemes have the common property that . the conversion of 
continuous quantities to discrete quantities is done on scalars. A fundamental 
result of rate distortion theory, the branch of information theory that deals with 
data compression, is that better performance can always be achieved bycoding 
vectors instead of scalars, even when the data source is memoryless, l^ector 
Quantization systems map a sequence of continuous or discrete vectors into a 
digital sequence suitable for transmission. In VQ systems, the collection of all 
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possible reproduction vectors is called a codebook. Starting with an initial 
codebook, the Lloyd's algorithm, generalized now for vectors, can be iteratively 
employed on a training sequence of vectors to obtain a quantizer for vectors, 
which maps all the vectors lying in a particular region to a digital sequence. This 
quantizer is optimal in the sense that some distortion measure is minimized for a 
particular family of vectors. Analogs of predictive coding have been proposed for 
VQ systems with memory tSl. 

Certain recent advancements, called second generation coding methods CB], 
make extensive use of the advancements in the brain mechanisms of vision to 
achieve higher compressions than the methods discussed above. Typically, they 
yield compression ratios of 50 as compared to 15:1 possible using transform 
coding. These methods are likely to achieve even better compressions than that 
predicted by the entropy of popular image models. This is because firstly these 
conventional models are simplistic, and secondly because they do not take into 
account what the eye sees and how it does so. In the discussion that follows the 
model for the eye is assumed to be a LPF representing the spherical aberration, a 
logarithmic nonlinearity representing the photoreceptor response, and a HPF 
representing the lateral inhibition process in the eye CIO]. 

The local opertor based methods are developed on the synthetic hiah method 
or sub-band coding which has been known for a long time. In this scheme C5,8], the 
image is first split into low and high frequency components. The low pass image 
can be sub-sampled, in accordance with the sampling theorem, and about 5 bits/pel 
give adequate resolutions. The high frequency components can, in turn be coded 
using about 3 bits/pel because the eye can tolerate larger brightness errors in 
this frequency range because of its low sensitvity in this frequency range. In the 
pyramidal coding method the image is first low pass filtered to give and 
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the error image 
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Coding the error and the low pass filtered image, instead of the original image is 
economical, as explained above. The above process is repeated a number of times 
to give 



'i,3 “^i,j*^*^i,j " ^ 


Ic ICHf" i. 

The difference of two gaussian like LPF's, h and h , is good approximation to 

the lateral inhibtory response of the eye. Note that the information needed to 

kwHh i. k 

code X is less than that required to code x ' because of the sampling theorem. 

Good quality pictures with average compressions of iO:i are obtained using the 
pyramidal coding scheme. 

The anisotropic non-stationary predictii^e coding scheme is another hybrid 
coding scheme which yields high compactions, typically 50 ;i without undue sacrifice 
of picture quality. This makes use of the non-isotropic Wiener filter, traditionally 
used for image restoration in the presence of additive, white noise, as a 
prediction filter. The high compressions in this framework are due to the fact that 
the filter makes use of parameters related to the magnitudes and directions of 
local rectilinear features in the image and this may be viewed as the estimation of 
non-linearities within the image. Details may be found in CSl. 

The second group within the second generation methods achieves economy by 
representing images in terms of textured regions surrounded by contours, in such 
a way that the regions correspond to objects in the scene. The contours and 
textures are coded separately. Effecient codes exist for contour coding. Because 
of the low information content texture can also be coded effeciently. Contours may 


be detected in two ways; by region growing and by edge detection. In region 
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growing based methods the image is iteratively -fitered by an ini'erse gradient 
filter, which removes the granularity in an image without affecting the contours 
The mechanism of region growing is as follows: regions are first characterized by 
some property e.g. the gray level of a pel. Starting with a pixel, the succeeding 
pixels are examined to find if they have the same property as the first pel. When 
the process is over, a large number of contours are obtained that do not 
necessarily correspond to objects in the scene. The number of contours is reduced 
by eliminating those regions which contain very few pels in them, those regions in 
which the contrast difference from the surrounding areas is below a threshold 
and also removing those contours which are contained wholly inside others. The 
remaining contours are then approximated by line segments, arcs of circles etc., 
and this information is coded. After this, the general gray level shape in the 
regions described by the contours is appproximated by a two-dimensional 
polynomial function. The granularity is added in terms of pseudo-random noise and 
the polynomial coeffecients are transmitted. Compression of about 50 :i are 
attained in this way. Details of this method and others, like directional 
decomposition based coding, may be found in [81. 

. After this brief survey of the practices prevelant in image coding, the 
problem considered in this work is formulated. 

1.2 The problem 

The need often arises to reevaluate and modify traditonal methods in order to 
achieve certain ends in a better way. Over the years, the AR, MA and ARHA models 
have established their stronghold over the field of image modelling. However, in 


Ciil it is shown that these linear models are nearly gaussian. Since image data is 
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mostly non-gaussian, alternative models for non-gaussian time series need to be 
investigated. Numerous such models, directed towards representing non-gaussian 
time series have been constructed and studied by various authors [1,14-201. The 
above work extends the standard AR, MA and ARMA schemes to processes, the 
variants of which may prove useful fo image modelling. 

More important than the fact that the processes are represented by a non- 
gaussian model, is the particular structure of these processes which makes them 
amenable to more compact representation of images, than is achievable through 
conventional predictive methods like the DPCM. One such scheme which shows 
promise in image coding has been suggested by Gibson [13. This is based on the 
first order AR process in which the excitation is absent with a probability p. The 
data compression of such processes has never been investigated before. The 
objective of this study is to evaluate the performance of this scheme in terms of 
implementation feasibility, the compressions achieved and the degradation caused 
in coding various real life imagery. Attention is also paid to identifying the best 
methods among a plethora of possibilities, investigation into the use of similar 
other models suggested in literature and the extensions of this scheme to higher 
orders and dimensions. 

1 .3 Organization of the report 

The report on the study is organized as follows: 

Chapter 2 introduces the properties of the lEAR(l) process and its variants. The 
method of choosing the predictor coeffecient, the compression 
promised by this scheme and its differences with the traditional AR 
processes are explored. The DPCM and lEAR(i) coding methodologies are 
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contrasted. 

Chapter 3 considers the details involved in the implementation of the algorithm 
for image coding using the IEAR<i)/IEAR-IC(l) process and catalogues 
the various configurations in which these may be used for image 
compression purposes. 

Chapter 4 discusses the results obtained in coding real life imagery using the 
lEAR(i) process and its variants. These are compared with those 
obtained in image coding using 3-bit DPCM. The configurations of the 
algorithm which yield the best overall performance are identified. It 
finally reviews the overall conclusions reached in the study and 


points out the directions for further research. 
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Chapter 2 


The IEAR[1) Process 


This chapter discusses the lEAR(i) process within a general framework of 


predictive coding methods. 


2.1 Predictive techniques for image data compression 


In pulse code modulation (PCM), successive inputs to the quantizer are 

treated in a memoryless fashion. Often the data to be transmitted has statistical 

redundancy from one sample to the next. Consider a random sequence £xn3 in which 

R 

the the samples upto n=k-i have been transmitted somehow. Let x n denote the 
reproduced value of Xn- In predictive schemes, when Xj^ is to be transmitted, a 
quantity x^j^, is predicted from the previously transmitted values and a prediction 
error sequence 


aR 

®k= k 


( 2 . 1 . 1 ) 


is formed and quantized to give e^|^. The reproduced value of x^, is given by 


R aR Q 
^ k= ^ k+e k 


( 2 . 1 . 2 ) 


A common method using predictive quantization is DPCM. The principal components 
of a pedictive coding system are the predictor and the quantizer (fig. 2.1). 
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The error in reproduction of X|^ is given by 


V vR 

®^k“ k~ ®k‘® k 


(2.1.3) 


and is equal to the error in the quantization of Sj^. 


2 aR 

To minimize 0 e , the variance of the predicton error; x ^ should be the 


best mean square estimate of viz., the rr^ean of the aposteriori density i.e. 


R~ 

where X is the set of past reproduced values i.e. 

X^'l^= Cx^n. n<k} 

From ean.(2.i.3) the mean square distortion in X|^ is given by 
ECSx\3= 


(2.1.4) 


(2.1.5) 


From rate distortion theory [12], it is known that the quantization of independent 


gaussian samples, of variance can be done at an average bit rate 
n= |log2^ 

2 

where D is the average mean square distortion per sample and D < q C2,123. 


(2.1.6) 


Assuming ej^ to be gaussian, the quantization of can be achieved at the minimum 


achievable rate 


1 

'^DPCH= 


(2.1.7) 


If the original sequence were quantiz:ed, the minimum achievable rate for X|^ would 


1 ^if 

npcM= |1032 C-j^> 


( 2 . 1 . 8 ) 


for the same quantizing distortion c7’^Q(k> 


Hence the reduction in achievable rate of predictive quantization over DPCM is 

cr2 

^PCM''^DPCN= 

cx & 


(2.1.9) 
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Hence, the data compression ability depends upon the variance reduction by 
prediction, i.e., the ability to predict Xj^ and therefore on the intersample 
dependence of £xnl . Therefore, the underlying principle • in DPCH is to remove the 
mutual redundancy between successive samples and to quantize only the new 
information [21. 

The introduction of the quantizer inside the feedback loop presents 
analytical difficulties. In most treatments, it is convenient to consider the 
prediction process separately from the quantization procedure. 

The next section discusses some properties of the standard AR process. 
This will be followed by a brief note on the DPCM of these processes. 

2.2 Autoregressive processes 

An image is often considered as a collection of one-dimensional signals. The 
one-dimensional signal is obtained from the two-dimensional image field by a 
suitable concatenating strategy, e.g., by considering the samples to be the output 
of a raster scanner. In the AR representation, the one-dimensional signal is 
assumed to be generated by a linear system, whose impulse response is specified 
by a difference equation, and which is forced by a white noise or another random 
sequence of known spectral density function (abbreviated SDF> C2, 221. 

The pth order causal AR representation for a zero mean, stationary random 
sequence is given by 

X|c~ 2 ®n*k-n'^^k ' E££j|.l=D, E£fc|^fcj^l=l3 (2 .2.1) 

n=l 

where £|^ is a zero mean, white noise, independent of xj for 1 < k. 

For a stationary process with known autocorrelations, the coeffecients an can be 


determined by solving the well known Yule-Walker equations, using Levinson's 
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algorithm [21. Once the coeffecients an have been so determined, the quantity 
^ P 

^n^j<c_n (2.2.2) 

n=i 

is the best mean square predictor of based on the past p values .The sequence 
X|^, defined by (2=^) ^ above, is a pth order Markov process. The AR process is 
asymptotically stationary and stable, in the BIBO sense, if the roots of the, 
polynomial 


-n 


Ap(z)= 3^n2 
n=i 

lie inside the unit circle. If the process is stationary it's SDF is given by 


Sx(z)= 




Ap(z)Ap(z”^) 


, z=e ,-ir<u)<ir 


(2.2.3) 


Given an arbitrary set of positive definite covariances Cr|^} on a window 
W:{-p s k s p}, there exists a unique AR model which can generate these r^^ over W. 
An important consequence of this result is that any given SDF can be approximated 


arbitrarily closely by a finite order AR spectrum, i.e., 

2 


Lt 

Pt-40O 


^P 


Ap(z)Ap(z n 


::r=sx(z) 


(2.2,4) 


Moreover, outside this window W, there exists a unique maximum entropy 

extrapolation of the given covariance sequence, and is given by the covariances 

generated by the AR model o^ (^t8) /- 

P 

rn4.p=Sakrn+p.k ^ n^l. (2.2.5) 

k=i 

If the order of the AR process is large, Xj^ is the sum of a large number of 
random variables and hence, the density of X|^ is the convolution of the densities 
of the random variables on the right side of j It follows from the central 

limit theorem that Xj^ is gaussian as pn+eo. 


The properties of the AR process dicussed above will be useful in 
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comparison between the AR and the lEAR processes. 

2.3 DPCM of markov processes 


If the signal is modelled to be a pth order Markov process 

P 

Xk= E ^nx^-n+^k 
n=l 

the best predictor for Xj^ in the m.s. sense is 

A ■ P 

X|^= Kxj^lxj , k-p s 1 <: k-l}= E®^nXj^.n 

n=l 

Hence in DPCM the predicted value is taken to be 


^ k-2i.^n X 
n=i 

An encoder-decoder may now be designed to implement this rule. 


(2.3.1) 


(2.3.2) 


(2.3.3) 


2.4 The general linear predicton formulation 

The general problem for linear prediction in the absence of quantization 
errors is now formulated. 

Suppose the kth sample is to be predicted by taking a linear combination of 
the past p samples. Let 



^ k= '^^k-l' ^k-2' '^k-p^' 

(2.4.1) 


a'^= Caj^/ a^i iap3 

(2.4.2) 


and Xj^= predicted value. 

(2.4.3) 

The prediction 

error is 



®k ~ *^k'^k “ 

(2.4.4) 

Now, 


(2.4.5) 

and 

EC e\3 = EC x\} + A^EC A - 2EC x,^x''^,^3A 



= Rx(0) + a’^RjjA - 2F^!» 

(2.4.6) 

where is the (p-1) x (p-1) correlation matrix for C xp 
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and P'''= EC x,^x''',^}=CRx<i), Rx<2), , Rx(p-1)3 


For finding the weight vector A which yields the minimum m.s.e. we must 

•J 

have V^CECe j^}}=0; where represents the gradient operator. Solving this set of 
equations we have 


ri 


RxA = P or 


(2.4.7) 


This gives the weight vector which yields the m.m.s.e.. The value of this 


m.m.s.e. is 

EteVmin" Rx'O'-f'^^DUmum '2.4.8) 

!t can be shown that for EC6j^Xj^}=0 ,ie.j the error sequence is 

orthogonal to the past inputs. Note that these assumptions are only approximately 
true in the presence of quantization errors. Assuming the process to be ergodic, 
the autocorrelations may be estimated by finding the time averages. There are two 
popular methods for doing this, the autocorrelation method and the c 0 vB.ri 3 .ncB 
method, The equation (2.4.7) above can be written as 


aj^Rx(k-i) = Rx(i) , is i iP 

k=l 


(2.4.9) 

In terms of the estimates of the a.c.'s, to be obtained from an ergodic time series, 


this can be rewritten as 


2 ^k^^n-k^n-i~ iS^n^n-i ' iS i SP 
k=i n n 


, N-i-i 
n=0 


(2.4.10) 

The autocorrelation method assumes the data to be zero outside a window 0:^n:^N-i. 
For this range of n, we have 

-i-i 

(2.4.11) 

In the covariance method,the error is minimized over a finite interval O^n^N-l.The 


method gives the set of equations 
P 

^ a|^.(Px<k,i) = (px^OA) , Is i SP, 
k=l 

N-1 

where iPx(i^k)= ^ 
n=0 


(2.4.12) 
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, ,-711 it is also shown that 

Details of these two methods can be found in [211. In uu 

„ ..correlation or specrum 

linear prediction can be viewed as a technique for auto^^ 

approximation. (See equation (22.4) also). 


2.6 Image Modelling 

L distribution of a given 

Image models are attempts to describe the intensity 

, ,coect of the intensity 

image. Each model can only capture some important a^^* 

xi. moortant aspects of the 
distribution and no model can imbibe in itself all the 

mentation or coding, there 

data. For every basic image processing task, such as segmei 

particular model for the 

is a plethora of algorithms and each algorithm assumes a 

, -nared theoretically when 

image. The performance of different algorithms can be comp® 

.• i 5 S the validity of the 

we understand the underlying image models and the discu=> 

.= in understanding the 

assumptions made therein. These considerations help 
causes behind the success or the failure of a particular 

• ,e>n image contains four 

The methodology of choosing any model for a give' 

• I to the given application, 

steps. First, recognition of the type of model appropriate 


recognizing the inevitabilty of noise. The second step 


is a criterion for 
The third is the 


aClB * * 

estimating the model parameters using the given imaa 

, x^^.r'tion. The last step is a 

development of a decision rule for the task, e.g., edge deteuv 

pi the algorithm. 

qualitative and quantitative analaysis of the performance ot 


2.6 Alternative AR models 

p 

statistical convihience.. 

Most of the stationary time series analysis, rof" ^ 

gaussian. Marginal 

assumes the marginal distribution of the time series to 

statistical procedures 

distribution is not considered of interest per se and 
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used very definitely make normality assumptions. For ARMA processes, if the 
marginal distribution is specified to be gaussian then the innovation sequence is 
known to be gaussian, also. If some other marginal distribution is specified for the 
ARMA process the choice of the distribution of the innovation sequence is by no 
means obvious. Alternative AR, MA and ARMA were constructed [1,14-173 to model 
data which are known to be non-gaussian. For example, in 1980 Gaver and Lewis 
determined that for Cxp, having a first order AR dependence, to have an 
exponential distribution with parameter \, X(^ must satisfy 

= px(^_^-t-e|^ w.p. 1-p (2.6.1) 

where Cep is an i.i.d. sequence of exp(\) r.v.'s 

This methodolgy was extended to higher order exponential AR, MA and ARMA 
processes in [153. Here the process Cxp is exponential, the innovations sequence 
Cep is also exponential and the autocorrelations are determined by linear 
difference equations. The models are linear in both the variables and the 
parameters, but the linearity is associated with a probabilistic choice between 
several linear combinations of r.v.'s. Variants of these processes are presented in 
[16,173. 

The following disscusion concerns construction of first order AR models 

C..- 

?Dr pro^sses with a specified marginal distribution. These are precisely those 
models which find use in image coding. Higher order AR, processes are found to be 
complex to implement and without accompanying increase in compressions or 
betterment in image quality. 

Consider the first order AR process 


Xj^=: aX|^_j^+£,j^ 


( 2 . 6 . 2 ) 
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used very definitely make normality assumptions. For ARMA processes, if the 
marginal distribution is specified to be gaussian then the innovation sequence is 
known to be gaussian, also. If some other marginal distribution is specified for the 
ARMA process the choice of the distribution of the innovation sequence is by no 
means obvious. Alternative AR, MA and ARMA were constructed [1,14-173 to model 
data which are known to be non-gaussian. For example, in 1980 Gaver and Lewis 
determined that for Cxp, having a first order AR dependence, to have an 
exponential distribution with parameter \, Xj^ must satisfy 

X|j=: W.p. p 

~ '^^k-l''’®k (2.6.1) 

where Cep is an i.i.d. sequence of exp(X) r.v.’s 

This methodolgy was extended to higher order exponential AR, MA and ARMA 
processes in [153. Here the process Cxp is exponential, the innovations sequence 
Cep is also exponential and the autocorrelations are determined by linear 
difference equations. The models are linear in both the variables and the 
parameters, but the linearity is associated with a probabilistic choice between 
several linear combinations of r.v.'s. Variants of these processes are presented in 
[16,173. 

The following disscusion concerns construction of first order AR models 

f , 

I or protesses with a specified marginal distribution. These are precisely those 
models which find use in image coding. Higher order AR, processes are found to be 
complex to implement and without accompanying increase in compressions or 
betterment in image quality. 

Consider the first order AR process 

( 2 . 6 . 2 ) 


x,,= 
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where is independent of Xj for 1 ) 3 . 

This process has a dependence structure characterized by an exponentially 
decreasing autocorrelation function 


Rx(m)= ECxnXn+m5= 


(2,6.3) 


The characteristic function of the r.v. Xj^, is 
ECexpC-juxi^ll 
= ECexpC-3U)(aXj^_j^+fcj^)33 

Because is independent of we have 


(2.6.4) 




(2.6.5) 


Assuming Xc, is marginally stationary, the subscript k may be dropped and we 


<l>r (u)= 




( 2 . 6 , 6 ) 


Now we can specify the distribution of X|^ and examine the conditions on the 
distribution of which give rise to this distribution of X|^. 

If we let the distribution of X|^, to be Laplacian, with parameter \ then 




i+c^r 


(2.6.7) 


From (2.6.6) we find that 

^e= cc^+d-a^iC — i-^3 


( 2 . 6 . 6 ) 


Hence^ we conclude that takes the value 0 with probability and is a 

2 

Laplacian distributed r.v. with probability (i-a ). We can now write the difference 
equation for the generation of Cxp as 

^k= 

= ax^_. w.p. 




(2.6.9) 
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where £ep is an ii.d. sequence of Laplcian r.v.'s. 

However, it must be noted that given the output distribution, the input 
distribution can be found dirctly, using (2.6.6), but not vice versa. Also not any 
characteristic function can be substituted into (2.6.6) and yield a quantity that is 
a characteristic function. In other words, not all output distributions are possible. 
Let the output distribution be symmetric with a zero mean. i.^e wish to determine 
the conditions on the characteristic function of the output, such that it can 

be generated using the first order AR process, is a real, symmetric, positive 

definite function equalling unity at the origin. If 4>x^u>) is not a decreasing 
function for positive co, the ratio will exceed unity for a greater than 

some critical value, ac- In this case, a highly dependent series for a specified 
marginal distribution would not be possible, whereas one with a lesser correlation 
would be. Only when is a decreasing function are all values of alpha 

permitted. Now, depending on the behavior of <^x^(j))/4>x^au>) as two structures 

arise. In the first case the tails of the characterestic functions fall sufficiently 
fast so that the ratio approaches zero. The gaussian and the hyperbolic secant 
densities have this property and in these there is no zero defect, as visible in 
(2.6.9). On the other hand, if the limit is non zero, then the density of must have 
an impulse with an area equal to the limiting value at the origin. With a probability 
equal to this value, the input is zero at a given sample. This is manifested by the 
Laplacian and the exponential densities. 

As another example consider the case when the marginal distribution of x is 
specified to be uniform over Z-L |]. Then ^x£f^>= ^£<o)= if the 

numerator is zero at c<)=uq then the denominator equals zero at w= ojQ/a. For ^^(a)) 
to be bounded at this point (o=WQ/a), the numerator must also equal zero at u=UQ/a 
and in turn the denominator must be zero at u)= . This condition then becomes 
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recursive and hence must have an infinity of zeros if it has any. Also, only a 

countably infinite values of a are now permitted. In case of the uniform density 


these zeros are equally spaced, i.e., u)|^=2k'jT . Here, 

( 2 , 6 . 10 > 

For to be a valid characteristic function a must be the reciprocal of any 


asiDMZ} 

sin(au/2) 


integer greater than one in magnitude .,i .e ., 
a= i ,k=±2,±3, 


(2 .6.1 i) 


In this case is an i.i.d. sequence with each element assuming the values 
-(lkl-i)/2lki, -(lkl-3)/2lkl , , (lkl-3)/2lkl , (lkl-i)/21kl with equal probability [18- 

203. 


We will call a process represented by 


^k= ®^k-i 

w.p, p 


= ax|^.^+ ej^ 

w.p. i-p 

<2.6.12) 


as a first order intermittently excited /Vt process^ IEAR{V. The lEAR(i) 
process is completely specified by stating (i) the value of a, (2) the probability p 
and (3) the distribution of Xj^ or the distribution of ej^. 

A stationary sequence is said to be time reversible if the joint 

distributions of (Xn^^, ^ and Cx_n^j ^-n 2 , , same for all 

k and n 2 , , n^,. The sample functions of non-time reversible processes 

exhibit different statistical properties in the forward and backward directions. 
All gaussian processes are time reversible since their distributions are 
determined by the covariance function which is symmetric. Non-gaussian processes 
may or may not be time-reversible. It can be shown C203 that all non-gaussian 
processes which are modelled as the outputs of linear systems , in general, and 
the lEAR(l) process, in particular, are time irreversible. One of the consequences 
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of non time reversibility of the lEAR(i) process is that the backward mean square 
prediction error is less than the forward mean square prediction error. As ar 
example consider the case for the uniform distribution of Cxp considered above. 

Here, Xn= 

with £,n as defined above. In this case x^_jL is entirely determined by Xn, 

i.e., 

x^_^=( kxn ) mod i- 0.5 , for k even 

and ^n-1”^ kxn+0.5 ) mod i- 0.5 , for k odd, (2.6.13) 

while the error i» determining Xn from Xj^_^ is non zero, the mean square error 
equalling the mean square value of Cn Ci8-203. 

As another example consider the lEAR(i) process described by (2.6.12). For a 
first order Markov process, all distributions can be generated from the second 
order distributions. For time-reversibility we require 

Pxn^,xn2^^''=^^“ (2.6.14) 

Shifting the axis by n^+n 2 and using stationarity, we find for time- 
reversibility 

Pxn2.Xn^^^'y^ ' all ni,n2 (2.6.15) 

i.e., the second order distribution must be symmetric. Now, all second order 
distributions can be obtained from Px^ ^,Xn^X'y^- Hence if Px^ symmetric 

so are all the other second order distributions. Let us see, whether for the 
process described by (2.6.12), Px^ ^,Xn(5c,y) is symmetric or not. We have 

#’X|^(u)= ECexpC-joXj^ll 

0*"^ P E(exp[-30)aXjj,_j^33 + (1-p) E(expC-jw(aXj^_j^+ej^)33 
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or, 

tf 

3 

or, 

<te, (tj)= 

''k 

Now, 



^^k-i,^k 

or, 

^^k-i,^k' 

or, 



^k-i' 




vr 


(2.6.16) 


( 2 . 6 . 17 ) 


( 2 . 6 . 19 ) 


Substituting from (2.6.17), we have 

'^k-i 


^k-i 


(2.6.19) 


( 2 . 6 . 20 ) 


Now, 


^'Xk.i,Xk^u,v) = p#x,^_^<u+av)+(i-p)^^X(^.^<u+av)#e,^(v) 


Dxp,_^,xn^=«'y >= j J X;j<u,v)exDCjux+jvy3 du dv 


(2.6.19) 


( 2 . 6 . 21 ) 


u V 


From (2.6.19) we have 

PXj^ ^,Xn^x,y) =p J y'^X|^ ^(u+av) du dv+d-p) J J ^iu+av)^e^M du dv 

u V u V 

This integral is easily evaluated to give 

PXf.j.^,Xn'^Xiy^= P Px^_j^(x)S(y-ax) + (i-p) Px^_^<x) Pej^(y-£6x) (2.6.22) 

This is clearly asymmetric and this shows that any process described by 
(2.6.12) is time irreversible. 

The autocorrelation function for the lEAR(i) process can be easily 
determined. 

Rx(1)=ECX(^Xj^_ 23=P aE£x|^_j^X[,_j3 + (i-p) aE£Xj^_j^X|^_p + (i-p)E£e(^X|^_25 

(2.6.23) 

Noting that is independent of Xj^^j for 1 > 0 and that E{ejj.3=0, the last term 



28 


equals zero. Therefore, 

RxQ) = a Rx(l-i) (2.6.24) 

or, RxQ) = cc^ Rx(0) (2.6.25) 

Hence the autocorrelation structure of this process is the same as the 
conventional AR process. 


h- 


2.7 Coding of the lEAR(l) process 


The structure of the lEAR(i) process makes it conducive to data compression 
efforts. The equation (2.6.12) is reproduced below 


:= “^'k-l 

w.p, p 

....1 

= “^k-1+%. 

w.p. l-p 

....2 


The case ...,1, when the input is absent will be referred to as the zero error (2E) 
condition, and the case ....2 as the non zero error (NZE) condition. It is felt that, 
since the model incorporates an alternative where the error term is zero, this 
property would be made use of in simulating solid gray level areas or slowly 
changing areas better. 

A ■ g^edj ^=i ^srn3c hem Q ;Jjgstr ^ as aia£ X2r.S^145 Shi^h.-iia.Jj a:g^ predictor 
generates an estimate of the present sample using 

(2.7.1) 

If then ej^=0 and the coder communicates the occurance of the 2E condition 

to the receiver by a short binary word. The reconstructed value at the receiver 
is 


x\= (2.7.2) 

aR 

On the other hand, if Xj^ is not equal to x then the error 

aR 

®k“ ^ k 


<2.7.3) 
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is formed and quantized to a suitable number of bits. The quantized error is coded 
and transmitted , and the receiver reconstructs the sample as 

(2.7.4) 

Since the number of bits used to code the ZE condition is much lower than the 
number used to code the quantized error, a reduction in bit rate, for the same 
SNR, is achieved over conventional DPCM coding, which would assign the same 
number of bits to code the error, irrespective of the 2E condition. As the 
probability p increases the compression improves. Further, if the probability p is 
close to unity, the ZE condition would occur with a high frequency, and the coder 
could signal a succession of ZE conditions collectively using run length encoding 
methods, bringing the bit rate down further. 

Instead of considering the scheme as one for reducing the bit rate, we may 

view it one for increasing the SNR, at a given rate. Since less number of bits are 

allocated to code the ZE condition, more bits can be assigned to a larger number 

of quantization levels to which the error can be quantized. The fidelity of the 

reconstructed signal would, in this case, be much better as compared with a DPCM 

for an equal bit rate. As an example, for p=0.8 and m2£=0.5 (i.e. an average of 0.5 

bits are used to code every ZE condition) and at a rate of 2 bits/sample, 

8 

bits/sample, i.e., as many as 2 levels can be used to quantize the non-zero error, 
at a rate 2 bits/sample .DPCM would, on the other hand, would assign 2 bits for 
every sample and would consequently obtain much lower SNR's. 

2.8 bnage modelling and coding using the lEAR(l) process 

As explained above, the lEAR(i) process is completely specified by the 
values of a, p and the distributions of either Cxp or Cep. In case of image 
modelling, one is interested in the distribution of pixel intensities, i.e., the 
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distribution of ixp. The actual distributions of the pixel amplitudes is very image 

dependent and can vary from gaussian, uniform, rayleigh to various bimodal 

distributions. However, restricting the image class under consideration, such as, 

high altitude photographs of the earth or head or shoulder images, reasonable 

assumptions on the pixel amplitude distributions may be made. Another point of 

interest is that when we are trying to generate a process with certain properties, 

we do not have the complete freedom in choosing the various characteristics and 

parameters. For example, the sequence Ce^ is not Laplacian with Cx^} Laplacian, 

2 

when p is not equal to a [il. Previous experiments with the differential encoding 
of images have shown that images with widely varying distibutions of pixel 
amplitudes have a pixel amplitudes which are Laplacian distributed. Hence, it is 
difficult to decide whether to fix the distribution of txp or to fix the distribution 
of Cep in equation (2.6.12). 

The difficulties in completely specifying an lEAR(l) model for a given region 
of the image does not preclude the use of the model in image coding. This follows 
because the precise distributions of either Cep or Cxp find no use in the coding 
of the lEAR(i) process, using a differential encoding scheme, as delineated in the 
last section. In the coding of an image using an lEAR(i) process, an attempt is made 
to identify a particular region of an image, obtained after suitably processing the 
image to get a one-dimensional signal, with an lEAR(i) process. This is achieved by 
forming the sequence' 

and assuming p, the probability of the ZE condition, to be ^ 0.5. Then from (2.6.12), 
we have 

yk= 5$];= « W.P. P 

= a + rr-!^ w.p. 1-p (2.8.2) 

*k-l 
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Hence, the sequence y|^ is examined, for each region of the derived i-d 
signal, and a recurrence of equal values of implies that this particular region 
can be identified with an lEAR(l) process with prediction coeffecient a. The value 
of p and the distribution of Ce^ are left unspecified, and may be assumed to be 
either changing over the given region or as functions too complicated to be 
described exactly. The coding is achieved by the method outlined in section 2.7. 
Over another region a different lEAR(i) model may be valid. In case the recurrence 
of equal values of y|^ does not happen, over a particular region, this region cannot 
be taken to be the sample function of any lEAR(i) process. It is shown in sec. 3.1 
that to achieve economy, over such regions it may be economical to revert to DPCM 
for coding such regions. Note that the assumption of ergodicity is inherent in the 
above approach since the model parameters are being estimated from the sample 
functions themselves. 

The above approach tries to isolates regions of stationarity in a* one 
dimensional signal derived from the image, and each such region is the sample 
function of a different IE ARC 1) process. This approach is essentially adaptive, 
since the predictor is altered as the statistics of the input process change. A 
simpler approach would be a non-adaptive one, where a value of a is decided a 
priori, depending upon some assumptions about the image statistics, and this value 
of the predictor coeffecient is used for the entire image. Here the entire image is 
being considered as the realization of a single lEARCl) process, or a concatenation 
of lEARCi) processes, with the same value of a but with the probability p and the 
distribution of e|^ that change over the image. The performance of the two schemes, 
the adaptive and the non-adaptive are compared later. 



32 


2.9 Differences of the IEAR{1) process and the inbuilt coding methodology with 
the conventional AR processes and the DPCM 

Consider a first order predictor for DPCM. The quantization errors are 
considered negligible. The prediction error is given by 


X|^ = a' (2.9.1) 



Now consider the lEAR(i) process 


X|j= aX|^_j[ w.p. p 

= e|^ w.p. i-p 

The mean square error is given by 

ECe\3= p.O + (l-p).E«X(^- <2 9.2) 

And the value of a that minimizes the m.s.e., is again 

<>^opt= <2-9-3) 

Suppose an lEAR(i) process is generated with a known a. (and some value of p and 
some distibution of Cej^} ). For this process, the autocorrelation function is known 
and we must have Rx(l)=a Rx(0). But a- gives the slope of the 

line with the best m.s. fit in the X|^-X|^_j^ plane. Also this line must pass through at 
least half the points in the Xj^-X(^_j: plane, for p ^ 0.5. Hence we see that for an 
lEAR(i) process with known autocorrelations, the best m.s. line passes exactly 
through at least some of the points in the X|^-Xj^_j^ plane. In this case the coding 
process would inherit all the properties of DPCM, such as orthogonality. 
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When we fit a model to a given time series, the underlying distributions of 
which are unknown, the situation is different from the one above. Here, two lines 
can be fit to the data (asssuming an lEAR(i) representation of the process can be 
found). The first is a line passing through some of the points in the plane, 

the slope of which gives the value of the predictor coeffeoient, a, for the coding 


procedure corresponding to the associated lEAR(i) process. The other is 


a line providing the best m.s. fit in the X|^- 
plane, associated with DPCM. The 
prediction coeffecient for DPCM is 


determined frorri ^ — 

2* i 


where the 



range for i may be chosen in accordance with one of the the well known 
autocorrelaton or the covariance methods. In case of the lEARd) process 
the predictor coeffecient is determined from the recurrence of equal values of 
Because the line corresponding with the lEAR(i) fit is, in general, 
different from the line of the best m.s. fit, the lEAR(i) process would result in a 
larger m.s. prediction error but affords greater compression since the occurance 
of the ZE conditon facilitates economy. Also because the prediction coeffecients 
for the two cases are different, for the coding using the lEAR(i) model, 
orthogonality would not hold, and consequently the complete decorrelation of the 
error is not achieved, resulting in larger error variances. It is also clear that 
whereas it is always possible to fit an AR(i) model to a time series it is not 
always possible to do so in the case of an lEAR(i) model. 
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2.10 Variants of the lEAR(l) model 

In the above discussion the implicit assumption has been ial<i. If the 

Iv* 

filter X|^ = a is unstable in the BIBO sense and also, since Rx(k)=a Rx(0) 

this reperesents an unrealistic autocorrelation situation in that the a.c.'s are 
increasing with the time index, k. But the case of tal^i is necessary in this 
development in order to model increasing or constant gray level regions in an 
image. Also since any image is necessarily of finite dimensions, here we will not 
bothered by conditions of BIBO stability. Hence we make no distinctions between 
the cases !al<i and tctl^i. This is another way of saying that, in the case of 
these algorithms, a no longer has the connotation of a first order autocorrelation. 

The IE ARC i) process taken up above enables one to model regions of either 
just increasing or just decreasing or constant gray level values. To accomodate 
regions in which gray levels both increase as well as decrease, another process 
may be constructed, the IE>4R-IC^lj process defined by 


:= «=fk-i 

w.p. 


= axi^_^+ 

w.p. P2 


~ a^k-i 

H.p. P3 


= S^k-i+e'k 

W.P. P4 

(2.i0.i) 


4 

with ^Oj=i and lal<i. Cej} and Ce';} are independent seauence of r.v's. 

1 

The abbreviation IE>CR-IC<'1^ then, stands 'for the first order intermittently 
excited process u'ith ini/erse coeffecient. Note that the prediction 

ooeffecients for the lEAR-IC(l) process can be specified by just the value of a and 
this is important from the point of view of parsimony. In this case, for a chosen 
region of the i-d signal derived from a given image, the sequence y'|^ with 
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If 5r-!^>i (2.10.2) 

*k-i 

is formed and the value of a decided depending upon the recurrence of 
equal values of y'j^. The predictor first forms the estimate 

x\ = (2.10.3) 

aR 

If Xj^=x the coder signals this condition to the receiver. Otherwise, the 
predictor forms the estimate 

' iA-i C IO-*’ 

Again if X|^=x^j^ then this conditioned is signalled. If Xj^j.<>x^l^ again, then the 
error e|^= X|^-x^l^ is formed, quantized, coded and transmitted. Note that in this 
case one additional bit is required, over the lEAR(i) process, to signal the ZE 

condition which may arise in two possible ways. But the frequency of the ZE 

condition in this case is expected to be higher than that in the case of the lEAR(l) 
process . 

The autocorrelation function of the lEAR-IC(l) process is 

Rx(l)=i0^RxO , /5= a + (|-a)(P3+p4) (2.10.5) 

where P2=l-PjL-P3-P4 been used in the derivation. 

In general, the coding process for the lEAR-IC(l) model would result in larger error 
variances than DPCM and does not decorrelale the error fully. 


2.11 Higher order and multidimensional processes 

Consider the second order process 


^k=®l^k-l 

w.p. 

Pi 


w.p. 

Pz 



=“3^k-l P 3 

=i3xj^_j^ +7xj|,_2 +®|(; w.p. 
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(Z.ii.i) 

The assumptions on Csj^} and p^'s are similar to those made in the previous 
sections. The considerations in the construction of this process are different 
from those in CiSl. Here we must remember that this model will find its use in 
image coding and we are not specifically interested in a solving a set of equations 
from which the distribution for the excitation, for a specified marginal 
distribution of the output, can be determined easily. A coding strategy based on 
the lEAR processes suffers from an increasing overhead, which comes into play, 
apart from that involved in quantizing and coding the various coeffecients, as one 
of an increasing number of possible 2E conditions must be signalled to the 
receiver, as the order of the lEAR process increases. For example, in the above 
case, two additional bits will be required to code which of the three possible 2E 
conditions has actually occured, unless a decision is made to code a succession of 
ZE conditions, occuring from the same alternative, by run length coding. Moreover, 
since the philosophy behind choosing the coeffecients is essentially different 
from that of DPCM, in that no minimization of the m.s. error is performed, these 
schemes can make no more claims than being ad hoc techniques to achieve data 
compression, as opposed to the theoretical soundness of DPCM, which enables it to 
establish more satisfying tasks like autocorrelation or spectrum approximation. In 
case of DPCM it can be shown that the error power decreases as the order of the 
prediction increases . Because the coeffecients of lEAR processes are chosen 
under a different rationale, no analogous statement can be made for the coding 
schemes based on these models. 

Exactly the same remarks as above can be made for two-dimensional lEAR 


processes by recognizing that equation (Z.ii.i) can be converted into a 2-d 
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prediction scheme simply by replacing by x^ j, by j and x^_2 by 

Hence in this work attention is restricted to the compression performance of the 
lEAR(i) and the lEAR-IC(i) processes in image coding. 



Chapter 3 


Aspects of impiementation 


In this chapter we take up the tradeoffs and the heuristics involved in a 
software implementation of an algorithm for image coding based on the lEAR(i) and 
the lEAR-IC(i) processes. Two methodologies are followed, as explained in sec. 2.8. 
The first an adaptive one in which the regions within an image are isolated into 
sample functions of different lEAR/IEAR-IC processes, each characterized by its 
own predictor coeffecient a; and the second in which the entire image is 
considered as the realization of a single lEAR/IEAR-IC process or different 
lEAR/IEAR-IC processes with the same value of a. Comparison is made with 3 
bit/sample DPCM. The implementation of the adaptive scheme is considered first, 
and that of the other would follow immediately. 


Recall that the lEARCi) process is described by 


^k= “^'k-i 

w.p. p 

= ax,,.i+e,. 

w.p. i-p 

and the lEAR-lC(i) process by 


w.p. 

= «^k-l+®k 

w.p. P 2 




W.p. P3 


- a^k-1 

= ^X|^_l+e'|^ w.p, P 4 . ( 2 . 10 . 1 ) 

3-1 The coding protocol 

The actual protocol in signalling is taken up first since this would clarify 
some points in the earlier stages of trying to decide the value of the prediction 
coeffecient for a particular region. For the lEAR(l) process one bit is used to 
distinguish between the 2E and the N2E conditions, i.e., if the ZE condition occurs 
a zero is transmitted; else a one followed by a 3 bit codeword for the quantized 
error is sent. Thus a run length coding scheme is not used. This aspect is 
discussed further in sec .3 . 6 . For the lEAR-IC(l) process again one bit is used to 
discriminate between the ZE and the NZE conditions. If the ZE condition occurs 
then another bit is used to indicate which of the two possible cases have caused 
the ZE condition. Note that if a=i.O then the lEAR-IC(i) process is the same as the 
lEAR(l) process, and the coding scheme for this case is the same as that for 
lEAR(l) process. 

Another aspect of the coding protocol needs to be taken up. In general, it 
may not be possible to fit an lEAR process to any given region of the one 
dimensional signal derived from the image. In such a region, since the ZE condition 
does not occur often enough no saving, due to the shorter bit word assignment to 
the ZE condition, is to be expected. Since the NZE conditions require 4 bits/sample 
to signal information about the quantized error which would ordinarily take 3 bits, 
it is better to revert to DPCM while coding such regions. Hence over any length of 
the signal, we have regions which have been coded following the lEARCi) philosophy 
alternating with regions coded using DPCM. This switch from lEAR(i) encoding to 



DPCM and vice-versa has to be indicated to the receiver by a word of length equal 
to that used to encode the the NZE condition^ viz., 3 bits to signal the switch from 
DPCM to IEAR(i)/IEAR-IC(i) and 4 bits to indicate the opposite case. 

With this foresight we are ready to go through the various steps in the 
coding algorithm, systematically. 

3.2 Converting the image into a 1-d signal 

The conversion of the image into a one-dimensional signal may be done in 
two ways: 

(1) treating the rows of the image individually and ignoring the correlations 
in the orthogonal direction. This is the practice in broadcast TV, where the image 
is scanned by means of a raster scanner. 

(2) dividing the image into blocks of size 4x4 or 8x8 and considering the 1-d 
signal to be a concatenation of the rows of the individual blocks. This method 
makes use of the image correlations in both the directions, especially for the 
smaller 4x4 blocks. 

It must also be remembered that the algorithm uses a buffer equal to the 
length of the i-d block and a large image size may make it mandatory to choose 
smaller i-d blocks. 

3.3 Isolating the IEAR(1)/1EAR-IC(1) regions and the determination of the predictor 
coeffecients 

Once the i-d block containing the PCM values is acquired, the process of 
isolating the lEAR/IEAR-lC regions begins, and the entire procedure is repeated for 


each i-d block. If regions of validity of the lEAR(i) process are to be isolated 



then the sequence yj^, defined in sec. 2.8, is constructed, else, when the coding is 
to be , accomolished using the lEAR-IC(i) orocess the sequence y'j^, defined in sec. 
2.i0, is •formed. The process of recognizing the regions of the derived i-d signal 
as sample functions of IEAR(i)/IEAR-lC(i) processes now proceeds in an identical 
fashion for either case. Hence, in the following discussion any reference to the 
sequence Wj^ will imply both y|^ as well as y'j^ , and will be valid for the IEAR<i) fit 
and the lEAR-IC(i) fit, respectively. 

Now the coder looks for recurrence of equal values of yj^. Here, it must be 
allowed for the fact that any model that we are trying to fit to the data, will be 
only approximately true. Therefore, exact equivalence of yj^ values is not expected; 
rather in a region where an incompletely specified lEAR(i) model is approximately 
valid, the sequence y|^ will take values that will lie in some neighbourhood of the 
model parameter a. This neighbourhood of a, in a. region of the image which, at 
best, closely conforms to the sample function of an lEAR(i) process with known a, 
can be attained in two possible ways: 

The sample values in the region may satisfy the relationship 

Xj^= aX|^_^+e w.p. p crit. i 

or they may satisfy 

Xj^= (a ± w^p. p crit. 2 

where a defines the neighbourhood of a. 

These two criteria for identifying lEAR(i) regions in an image will, in 
general, identify disjoint regions of the image as sample functions of IE ARC i) 
processes, i.e. one will not associate the same region of the image with an IE ARC i) 
process as the other. For example, in regions where the gray level values are low. 


the ratios of adjacent pixel values fluctuate over a wide range, even when the 
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gray level values themselves do not change so much. In this case, for the same e 
crit. 2 will not declare this particular region as the realization o-f an lEAR(i) 
process whereas crit. i will. Conversely, in regions of large x^’s, the ratios 
ususally remain close to one (in the absence of edges), though the pixel values 
themselves may undergo a large variation. Here, for small e, crit. i may not be able 
to identify such a region of the image with an appropriate lEAR(i) process 
whereas the same is not true for crit. 2. Exactly similar arguments are valid for 
the lEAR-IC(i) process with p replaced by Pj^+P 3 and noting that x^. and X|^_£ can be 
interchanged freely there. In chapter 4 performances of these two criteria in 
image coding are compared. 

In the arguments that follow, 'equivalence' of yj^, values would imply y|^ values 
being equal in an a-neighbourhood in the sense defined above by the two criteria i 
or 2. The probability p of the ZE condition is assumed to be ^ 0.5. This would 
imply that at least every alternate y-value, in a region of the image that can be 
modelled as a sample function of an lEAR(l) process, is equal to a. However, the 
algorithm, in order as not to hastily reject a region, which can yield a saving 
through coding using an IE ARC i) process, allows one instance of two y-values, equal 
in an e-neighbourhood and differing in two places, to be included in deciding if a 
particular region is to be coded by the lEAR(i) method. Also, the value of yj^ when 
both X[^ and X|^_j^ are zero is taken to be one, and the values when and X(^<>0, 

are assigned the value -i.O and are not considered while resolving the value of 
the prediction coeffecient for encoding a particular region of the image. 

The method described below uses crit. 2 above in isolating the realizations 
of different IE ARC i) processes within an image. The straightforward modifications 
in the method for crit . i are described later . 
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The method proceeds as follows: the y-array, derived from the 1 -d bloc 
obtained from the image, is scanned by a window, W=£wp i=i to 4 ), which spans foui 
successive y-values. W, in the beginning, is stationed at one end of the y-arrai; 
i.e., ■for' i=i to 4 . The a-value is taken to be yj_. If yj^=-i.O then W shifts on« 

place to the right and the next y-value, viz. y2> is taken to be the cs-value. This is 
repeated whenever any a-value being considered ' becomes -i.Q. The values of W2) 
and W4 are next compared with a and W shifts with Wj^ at the first w-value which is 
in an epsilon neighbourhood of a. After this restationing of W, W2 j W3 and w^ are 
again compared with a and W shifted appropriately; simultaneously, a counter 
keeping the count of the occur ance of the ZE conditions is updated. Note that once 
W4 is found equal to a, the length of the window is reduced by one, i.e., only the 
values of W2 and W3 are used in determining the equivalence of the W values with 
a, as explained above. This process is repeated till either the end of the i-d block 
is reached or none of W2^ W3 or w^, are found to be in an t-nbd. of a. Now, before 
deciding if this block is to be coded using an lEAR(i) process, it is seen if a 
sufficient number of ZE conditions, t, have occured, over the block length. The 
sufficiency conditions are examined in the next section, and a large enough t would 
indicate that this block would achieve better economy if coded using an lEAR(i) 
process than when it is coded using DPCM. If an enough number of ZE conditions 
are detected, then the starting and the end points of this block together with the 
value of the predictor coeffecient are stored. Alternatively, if a sufficient 
number of ZE conditions are not discerned, W moves back to so that Wj^ is is 
positioned at one place right of the y-value which was taken as a. The value in Wj^ 
is taken as a and the possibility of using this a-value as the predicton 
coeffecient for coding the following region is then explored, as done previously. If 
a succession of these a-values is found to yield an unsatisfactory freqency of ZE 
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conditions, this region cannot be coded using the lEAR(i) process without causing a 
deterioration in the compression performance, and is coded using the DPCM 
methodology . 

Sometimes, it may be possible to identify a particular region of an image 
with more than one lEARd) process. In a more common situation, the end of the 
region of validity of one lEAR(l) process may overlap with the beginning of the 
neighbouring one. By making a judicious choice as to the range over which a 
particular lEAR(i) process is to be used in coding, better compression than 
otherwise can be achieved. Hence, after it has been decided to encode a 
particular region using an lEAR(i) process, as described in the previous paragraph, 
W is moved back and is stationed with w^ at the first value in the region not in an 
e -neighbourhood of a. This value in Wj_ is next explored as a prospective value for 
a new prediction coeffecient, as outlined previously. If a large enough number of 
ZE conditions is not found, starting with this new seed, W is shifted with w^ at the 
next y-value not in an t-nbd. of the a chosen in the region previously. A sufficient 
number of ZE conditions starting from a seed a, may ultimately either lead to a 
new lEAR(i) process the region of validity of which is contained entirely within 
that of a previous lEARd) process, or may be partially overlap the region of the 
previous IEAR<i) process or may be disjoint from it. The entire process, explained 
above, is repeated till the end of the derived 1-d block is reached and at the end, 
it yields the starting and the end locations, and also the values of the various 
associated prediction coeffecients, for the lEARd) processes to be used in coding 
the 1-d block. The procedure to sort out these overlapping regions before coding 
is outlined in the sec. 3.5. 

The arguments for the use of crit. 1 instead of crit. 2 follow in an 


analogous fashion. Here the window W scans not the y-array but the i-d block 
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derived from the original image. An a-value is first calculated as the ratio of two 
adjacent x-values. After this, the potential of this a as a predicton coeffecient in 
the region is ascertained, as before. The products of W 2 and Wg with a are 
next formed and W is shifted with at the position of the first if the 

product of a and Wj^ lies in an a-nbd. of and simultaneously a counter 

updated. The process is repeated till either the end of the i-d bock is reached or 
none of the three products mentioned satisfy the crit. i. In the actual 
implementation two windows Wj_ and W 2 are used, W 2 moving over x-array as before, 
and Wj^ moving over the y-array and furnishing the a-values to W 2 for determining 
the occur ance of a 2E condition through the use of criterion (2). The two windows 
scan identical locations in the y and x arrays, respectively and each undergoes 
the same shifts as the other. The sufficiency conditions for this case remain the 
same as for crit. 2. At the end of the i-d block the procedure yields the starting 
and the end locations of the lEAR/IEAR-IC regions, in general overlapping, and 
these must be sorted out before the actual predicction/coding process. 

The above procedures make the algorithm one of a multipass nature, since it 
repeatedly scans a given length of the derived i-d block. The scheme also requires 
a buffer of the length of the i-d block, and the maximum admissible length of a 
potential lEAR/IEAR-IC region is also limited by this length. An example will help to 
clarify the above procedure: 

EXSimpte" Consider the case when we are trying to isolate the sample functions of 
an lEAR(i) process. Let us suppose that t, the number of occur ances of a ZE 
condition, must be at least 3, before a region can be identified as the realization 


of any lEAR(i) process. For the purpose of this example take fc=0. 
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The sequence y|^ has been obtained as: 

y,^ 0.7 0.9 0.2 0.9 1.0 i.O 0.9 1.0 0.5 0.9 0.3 

position 12 3 456789 10 11 

The procedure sequences as follows: 
t=l; 

STEF 1: Wj^ at 1 ; 1 -^ 2 = ^“^ 3 ' <> Wj_; shift Wj^ to 2. 

STEP 2: w^ at 2; W 3 =Wj^; shift to 4; t=2. 

STEP 3: Wj^ at 4; W 4 =Wj_, shift to 7; t=3. 

STEF 4: at 7, W 4 =Wj_; w^swj; already occured in STEP 3; shift Wj_ to 3; Extent of 

the identified lEAR(i) region is from position 2 to position 7 , reset t to 1. 

STEP 5: Wj^ at 3; W 2 . wg, w^ <> Wj^; shift Wj_ to 5 (because the yj^. value at position 4 
already taken in the previous lEARd) process). 

STEP 6 : Wj^ at 5j W 2 =Wj^; shift Wj^ to 6 ; t=2. 

STEP 7: W£ at 6 ; W 3 =Wj^; shift Wj^ to 8 ; t=:3. 

STEP 8 : at 8 ; Wg, Wg, W 4 <> Wj^, STOP; Extent of the identified lEAR(i) region is 

from position 5 to position 8 . 

Thus, the algorithm in the example results in overlapping lEAR(i) regions, 
which must be sorted out before coding can be done. 

3.4 The criteria for the sufficiency of t 

The criterion to determine if a sufficient number of ZE conditions have 
occured ensures that coding a particular region through an intervening lEARCi) or 
an lEAR-IC(i) process would yield a better compression than DPCM. Say that the 
length of the IE ARC 1) region under consideration is 1, i.e. U1 has scanned 1 pixels 
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before either the end of the i-d block is encountered, or none of W 2 » Wg or is 
found equal to a. Over this region say t probable ZE conditions have been 
discerned. It would take 31 bits to code this region using 3 bit/pel DPCM. If the 
lEAR(i) process is used to code this region the information would take 
t+4(l-t)+3+4+i bits. 

This number is arrived at by considering the coding protocol and the overheads as 
done in sec .3.1. 3 bits are used to indicate, the switch from DPCM to the lEAR(i) 
coding method, 4 bits to indicate the reverse switch, and 1 bit is used to indicate 
to the receiver if the region following the lEAR(i) region is coded using an lEARd) 
scheme or a DPCM scheme. If the lEAR(i) method is to secure an economy over DPCM 
then we must have 

t+4a-t)+3+4+ii 31 

or, t>i+2 

However, if the lEAR(i) region is situated at the end of the i-d block the last 5 
bits required to indicate the switch from the lEAR(l) to the DPCM will not be 
needed in signalling, and the above inequality, in this case, becomes 

tii*i 

In the case of the lEAR-IC(i) process, the signalling protocol would demand the use 
of 

2t+4(l-t)+3+4+i bits 

to code a length 1 with t ZE conditions over it. To achieve economy over DPCM one 
must have 

t>|+4 

when the lEAR-IC(i) region is not situated at one end of the i-d block, and 

t^+2 

when it is It must be remembered that when the case of a=i.O arises during the 
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coding using an lEAR-IC(l) process the coding process reverts to that of the 
lEAR(i) process and the former two bounds are valid for this case. An idea to the 
stringency of the above inequalities can be acquired from the tables below which 
show the bounds used in the implemented algorithm, represents the 

requirement on the minimum number of 2E conditions in a length 1 for the 
IEAR(i)/IEAR-IC(i) coding to perform better than DPCM. 


1 

^min 

1 

^min 

4 

4 

9 

9 

5 

5 

iO 

10 

6 

5 

il 

10 

7 

5 

12 

11 

9 

6 

13 

12 

iO 

6 

14 

12 

16 

8 

16 

13 

tmin=^l 3)+3 

tmin=^l 

div 2)-(-5 

for lEAR(i) process 

for lEAR-IC(l) process 


It must be mentioned here that all the above inequalities actually give lower 
bounds for t, since the additional overheads in quantizing and coding the 
prediction coeffecients, a, are not considered in determining these bounds. The 
reason for this is explained in sec .3 .6. 

The problem with the above scheme is that, although it promises the best 
way of allocating the lEAR/IEAR-IC regions, all the 2E conditions predicted do not i 
occur when the quantization errors are taken into consideration. This can be: 
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illustrated taking the example of an abrupt edge, where the gray level values jump 
from the minimum to the maximum. The algorithm 
assigns the entire range 1-2 as the sample function of 
an IE ARC!) process with a=i.O, and predicts that the 
NZE condition would occur just once. However, the 
quantization of the error values would bring about the slope overload error, and 
consequently it would take a large number of samples before the error would be 
close to zero. Under these circumstances the number of ZE conditions would be 
much greater than the predicted value, viz. one. 

This loophole in the above scheme paves the way for an alternative method 
which may perform slightly better than the former scheme. Here, the sufficiency of 
the ZE conditions is not determined through an inequality as before. Rather, it is 
determined through the number of ZE conditions exceeding a particular number, n, 
determined experimentally and chosen independently of the length of the region of 
validity of the IEAR(i)/IEAR-IC(i) process. As n is increased from a small value, the 
total number of bytes needed to code an image goes through a minimum. This is 
expected, since when n is small, the number of bits needed to code a particular 
image by representing it as a succession of lEAR/IEAR-IC regions interspersed with 
regions to be coded by DPCM, will be larger than if the entire image were to be 
coded using DPCM alone, because the overall saving occuring from coding the 
lEAR(i) regions will not be able to offset the extensive overheads caused in coding 
the regions with a small frequency of ZE conditions. At the other end, if n is kept 
too large, then again the bit rate increases, for the regions which could have 
yielded a saving are now left out because the decision of coding a region using 
the lEAR/IEAR-lC method now requires an overly high frequency of the occur ance 
of ZE conditions. Thus there exists an optimum value of n, which’ yields the 
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best compactions. This minimum is found to be relatively broad so that the values 
of n adjacent to i.e. nppj.+i, approximatey; yield almost the same bit rates, for 

a given SNR. The value of n^^p^^ remains nearly constant for various real life 
imagery. This value of Opp^^ is determined experimentally for each particular 
configuration of the algorithm, and then this n^p^^ is used to decide whether a 
suffucient number of ZE conditions have occured over a region. 

3.6 Sorting the overlapping !EAR(1)\EAR-IC(1) regions 

The overlapping regions, obtained after the above process is over, must be 
sorted out. The procedure outlined below chooses the combination of lEAR and 
DPCM regions which would yield the best possible savings. It disregards the 
interference caused due to the quantization errors. 

Let (a^,b£,aj^) and (a2,b2,a2^ be the starting point, the end point and the 
values of the prediction coeffecient respectively for two regions to be coded 
using different lEAR models. The following three cases arise here: 

Case i: If a2>b£ then the two regions are disjoint and are coded in turn by their 
respective a's. 

Case 2 : If a2<bj_ and b2:^b£ then the second region is wholly contained within the 
first, and the region <a^,bj_) is coded using as the predictor coeffecient without 
any further attention being paid to the region contained inside the first. 

Case 3 : If a2<bj^ and b2>bj^ then the regions overlap. By a method similar to that 
above, it is determined if the coding of the regions and Cb^,b2»a2j using 

their respective lEAR processes secures an economy over DPCM. If both these 
regions yield a saving then if the saving in (a^,a2)>saving in (bj_,b2l, then (aj|^,a2) 
is coded using and (a2)b2l is coded using qi,2> otherwise (aj_,bj^) is coded using 
and (bj|^,b2) is coded using a,2- I-f (a^,a2,aj_) yields a saving over DPCM and 



5i 


does not then (aj^,a 2 ) is coded using aj,, and (a 2 <b 2 ) is coded using 0 , 2 , and vice- 
versa. If adequate saving is not achieved in either (.^^> 3 . 2 ) or in (bj^,b 2 ) then out of 
the two "alternatives, coding (aj^.b^) using and coding (a 2 ,b 2 ) using a 2 > the one 
yielding a greater reduction in the number of bytes is taken and the other option 
discarded. 

All the (aj^,bj,aj^) values for a i-d block are paired and the procedure 
outlined above repeated for each pair. At the end it yields the sequence of non- 
ovar lapping IEAR(i)/IEAR-IC(i) regions which can code the given 1-d block in the 
most economical way. 

3.6 Differential coding of the image using the iEAR(l)/IEAR-IC{l) processes 

The non-overlapping regions obtained as a result of the above procedures 
can be coded as explained previously: the lEAR(i) regions by the scheme in sec. 2.7 
and the lEAR-IC(i) regions in sec. 2.10. The intervening regions are coded by 
straightforward DPCM. The value of the predicition coeffecient in case of DPCM is 
taken to be i.O. The reason for this is the fact that horizontal correlations for 
most of natural imagery are estimated to be around 0.96 C53. Hence an elegant and 
not too wasteful approximation is to choose the a for DPCM coding as i.O. 

The estimate for the first pixel in an image, x'^q q is taken to be the 
maximum value of the PCM quantization level in the image (e.g. for a 6 bit/pel 
image x’^qq= 63). For the successive blocks, the estimate for the first sample in 
the block is taken to be the reconstructed value of the pixel adjacent to it, i.e., 

aR r 

when the i-d blocks are obtained from the image then x =x ; otherwise 

when the 1-d blocks are obtained from the image on concatenating the rows of the 
2-d blocks (4x4 or 8x8), into which the image is sub-divided, the estimate for the 

• CC'iTR ! '■^RARY ; 

I f T.. i 
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first pixel of each i-d block is taken to be the reconstructed value of the pixel 
adjacent to it in one of the neighbouring blocks. 

Now we come to the problem of the two quantizations involved in the 
encoding procedure: the quantization of the prediction coeffecients, a, and the 
quantization of the error, The quantization of the a's is taken up first. 

In the absence of quantization noise in the prediction errors, it is possible 

I 

to do away with the quantization of the a's. Let x^ be the first sample value in a 
region which is to be coded using the lEAR coding scheme. Let Xq be the pixel that 
comes immediately before If the actual error values could be sent, with no 
quantization noise, then the predicted value for x^, x^^, could be taken as Xq, and 
the error between x^ and Xj_ transmitted to the receiver. The receiver would now 
evalauate a by finding the ratio Xj^/xg. This a-value would now be used by both the 
transmitter as well as the receiver as a prediction coeffecient in forming further- 
estimates Hence in this scheme, the estimate for the first sample in an lEAR 
region is formed taking a as 1.0. Here, the a-value valid for the entire lEAR 
region can be evaluated from two initial values, and the brunt of transmitting 
these a's car be avoided. 

If the quantizing noise is present, then the receiver forming the estimate of 
a from the ratio does not necessarily yield a value in the neighbourhood 

of the real a. An error in the value of a reflects as a sharp fall in the frequency 
of the 2E conditions, and the accompanying degradation in the compression 
performance. In such a case, the value of a must be quantized, coded and 
transmitted to the receiver. For the lEAR-IC(l) process the value of a lies in the 
range (0,11. Here a first approach is to quantize this entire range. In the 
implementations used, a 5-bit quantizer is employed for this purpose. Improvements 
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in this Quantization scheme stem from the realization that the most common a 
values are those that lie close to i.O. Here, the values of a, lying outside a small 
range around 'i.O are not considered while assigning the values of the prediction 
coeffecients for various lEAR regions, by the method in sec .3 .3, while thea values 
lying inside this range are quantized and coded. If the coding is to be done 
through the lEAR-IC(i) process, lai:^1.0 and the quantization intervals for a are as 
shown below 


80 83 

84 85 

86 88 

89 91 

92 94 

95 96 

97 98 

99 100 

xiO'^ 

81 

83 

87 

90 

93 

96 

98 

100 

xl0“^ 


In lEAR(i) coding, the a values can be greater than i.O and the following 
Quantization scheme is used 


86 88 

89 91 

92 94 

95 96 

97 98 

99 101 

102 104 

105 106 

1 

o 

X 

87 

90 

93 

96 

98 

100 

103 

105 

xlO'^ 


In the two tabulations above, the values lying in the first row in the range 
indicated are quantized to the values in the the second row. 

Now the details of the quantization of the error are discussed. It has been 
shown experimentally that images with widely varying distributions of pixel values 
have a prediction error which is Laplacian. Hence, the prediction error is assumed 
to have a Laplacian density, and consequently a non-uniform quantizer , such as the 
Lloyd-Max quantizer, would perform better than a uniform quantizer . in the 
experiments on the natural images done, a Lloyd-Max quantizer is used for the 
purpose of error quantization. The decision and the reconstruction levels for a 3 
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bitXpel Quantizer for a signal having a unit variance Lapiacian density are shown 
in Table 3.6.1 below [5], 




•oo 

-3.0867 

-2.3796 

-1.6725 

-i.2527 

-0.8330 

-0.5332 

-0.2334 

0.0000 

0.2334 

0.5332 

0.8330 

1.2527 

1.6725 

2.3796 

3.0863 

+QO 



Decision and reconstruction levels for a 8 level non-uniform quantizer, assuming 
the signal, to be quantized, to have a unit variance Lapiacian density. 

Table 3.6.1 


Before this Quantizer can be used ^ the values in the table 3.6.i must be multiplied 
by the standard devations of the prediction error. The estimates of the standard 
deviation of the prediction error are formed using the relation 



These estimates for various test images are shown in Tables 3.6.2 and 3.6 .o. 



Image Name 

^DPCM 

'^lEAR-IC 

^lEAR 

i. Lisa 

10.03 

10.12 

10.03 

2 Lincoln 

7.85 

7.92 

7.85 

3. Stalib 

11.08 

10.12 

11.08 

Estimates for prediction 

for £ bits/pel 

Table 3.62 

error sienderd dei'ietions 

images 


Image Name 

^DPCM 

^lEAR 

1. Cafe 

24.03 

25.27 

2. Plath 

10.88 

13.38 

3. Eliot 

13.15 

14.98 

4. Sail 

11.37 

13.28 

5. Sat2 

13.06 

15.90 

6. CafeHQ 

33.27 

33.76 

7. PlathHQ 

21.86 

21.89 

8. EliotHQ 

21.20 

21.91 


Estimates for prediction error standard 
dei^iations for 6 bits/pel images 


Table 3.6.3 
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A final modification in the quantizer design procedure is required. The 
reason for this is that one code word is required to signal to the receiver the 
switch from the DPCM scheme to the lEAR/IEAR-IC scheme or the reverse transition. 
As such, only 7 out of a possible 8 levels are available for the quantization of 
the error. Hence the levels in Table 3.6 .i must be modified, by solving the design 
equations £51. In this work this modification is made intuitively. First, the range 
over which the levels of the 8-level quantizer will vary is determined by 
multiplying the values in Table 3.6.i by the maximum and minimum standard deviation 
values in tables 3.6.2 or 3.6.3. This is necessary because it is assumed that the 
same quantizer is being used for all the images. Bearing in mind this variation in 
the decision or reconstruction levels, and also the non-uniform nature of the 
quantizer the number of levels may be reduced by one in a heuristic manner. These 
quantizers for the quantizing the prediction error to 3 bits for 6 bits/pel and 8 
bits/ pel images are shown below: 

-63 -26 -25 -10 -9 -3 -2 2 3 9 10-25 26 63 

-34 -18 -6 0 6 18 34 

7-level prediction error quantizer for 6 bit/pel images 

-255 -50 -49 -22 -21 -6 -5 5 6 21 22 49 50 255 

-64 -35 -12 0 12 35 64 

7 level prediction error quantizer for 8 bit/pel images 

Once the quantization scheme has been finalized, the actual details of 
coding can be discussed. The procedures desribed in sections 3.3-3 5 yield the 
starting paints, the end points and the ft values for the IEAR(l)/IEAR-lu(i) regions 



by means of 4 bits and the following bit indicates if the following region is to be 
coded using DPCM or the lEAR process. The procedure is carried on till the end of 
the i-d block ,and is repeated for each i-d block. 

Note that the scheme for the non-adaptive coding of images using the 
IEAR(l)/IEAR-lC(i) process follows in a natural way. A suitable value of cn, is first 
chosen and each i-d block is coded using this value of a. Since the entire image is 
assumed to be the sample function of lEAR/IEAR-lC process with this a; no regions 
are coded using DPCM. Although this scheme has the disadvantage that a much 
higher number of NZE conditions^ than the adaptive scheme^ have to be coded using 
a longer** (4 bit' word; the overheads required to indicate the switch between the 
IEAR(i)/lEAR-IC(l) coding strategy and the DPCM are eliminated in this case. An 
additonal attraction of the the non-adaptive method is its minimal implementation 
complexity. 

This brings us to the end of the discussion on the sotware implementation 
of the algorithm for image coding based on the IEAR(i)/IEAR-IC<i) processes. In the 
next chapter, we shall compare the compression performance of the various 
configurations of the algorithm described here. 
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Chapter 4 


Results and Conclusions 


In this chapter, we compare the performance, in the coding of various real 
s imagery, of the various algorithms catalogued in Chapter 3. This evaluation is 
ieved by analysing the inter-relationships between the implementation 
iplexity, the compressions achieved and the distortion caused in the experiments 
:,he coding and the reconstruction of suitably chosen test images. 

The main result that we shall attempt to arrive at, at the end of this 
pter, is that all of the configurations of the algorithm for image coding, using 
IEAR(i)/IEAR-IC(i) processes, result in a coding performance which is always 
se than a scheme of variable length coding of the quantized prediction error 
ues obtained from a closely related DPCM scheme. The route to this result, 
lever, would yield valuable spinoffs in the form of insight into the right choice 
parameter values, such as the range over which a prediction error value may be 
nailed as equal to zero, without any significant deterioration in the image 
lity. 

The test images assume the PCM coded amplitude resolutions of 6 bits/pel 
8 bits/pel, respectively. The two 6 bits/pel images are of the head and 
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shoulder type, and are named LISA and LINCOLN. The 8 bits/pel images include an 
image o-F an outdoor scene, named CAFE, a head and shoulder image, named PLATH, 
and a satellite image, named SAT , The coding oF the histogram ecjualized versions 
of the 8 bit/pel images has also been investigated. 

Although appropriate remarks have been included regarding the visual 
appearance of an image, coded and reconstructed using either DPCM or a scheme 
based on the IEAR(i)/IEAR-IC(i) process, to establish an objective comparison 
between the algorithms their performance must be viewed quantitatively. The 
coding schemes based on the IEAR(l)/IEAR-IC(i) process suffer from the 
disadvantage that neither the number of bytes, nor the signal to quantization noise 
ratio (hereafter referred to as SNR) is known until the actual coding has been 
completed, in other words, neither of these two quantities can be fixed a priori. 
Therefore, both the number of bytes used in coding the image, and the SNR's 
obtained must be examined together, before an understanding of the relative 
capabilities of the various algorithms is acquired. Here, we first proceed to 
identify the best configurations among the adaptive schemes, explained in Chapter 
3, and then compare their performance with the non-adaptive method. 

4.1 Performance of adaptive algorithms 

The results for the adaptive coding of the images PLATH and CAFE, using a 
3 bit quantization code-word for the prediction coeffecient a, are shown in the 
Tables 4.i.i-4.i.4. Because the sizes of these images is larger than those for 6 
bits/pel images, this helps to amplify the differences between the various 
algorithms. In these tables crit. i, crit. 2, crit. i* and crit. 2 refer to the criteria 
used in isolating the IEAR(i)/IEAR-IC(i) regions (sec. 3.3) and the criteria used in 
signalling the ZE conditions (sec. 3.6), respectively. The condition t^ n(l) means 
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Comparison of experimental results for the image PL/^TH 
Table 4.1.1 
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Comparison of experimental results for the image PLATH 
Table 4.iJ2 
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Comparison of experimental results for the image C/iFE 
Table 4.1.4 
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that one 0+ the ineaualiltes mentioned in sec, 3 4 are used in determining if a 

ZE conditions has ocoured so that coding using the 

lEAP'IEAf^-i: '■ ©Bulls in a sa.'ing over DPCM, The condition implies the 

alte'-na-.e rase wnere a region is to be coded using an lEAR/IEAR-IC process if 
the nurr.ize’- cf *.£ ccnaitions in the region exceed an experimentally determined 
number The number of bytes required to code the image and the signal to 

auanliza .iun ncise ratios are indicated for each case for the values of a and c‘ 
t-.e comparisons between the various versions of the algorithms 
are difficult to make, for no version is distinctly the best one, these tables 
enable us to rr.8»e the following general observations about the relative 
performaroe of the va’^iants of the coding process: 


The T-level quantizer quantizes a certain range of values of the prediction 
errors m the nbd of zerc as zero. ( For 8 bit/pel PCM coded images, it was shown 
in sec 2 6 that the range [-5,5] is quantized as zero). As the interval over which a 
prediction error, if it lies in this interval, is signalled as a ZE condition (i.e. the 
value of c', if the coding is done using crit. i') is increased a fall in SNR, 
corresponding to a considerable deterioration in image quality, is detected. This 
IS easily seen in Table 4,i.l, where the increase of e' from 8 to 15, when the ZE 
conditions ir® signalled using crit. i', causes the SNR's to fall from about 33.3 dE 
tc lees than 3D dB, without an accompanying increase in the compression ratios. 
Again, when the regions are isolated in accordance with crit. 1, and ZE conditions 
are signalled using crit. 2', an increase of t' from 3 to 7 causes a similar drop in 
SNP The Quantizer quantizes the range [6,211 as 12 ( and the range [-21,-61 as -12 ) 
and the ixiore t’ erodes into this range, the greater is the degradation in image 
quality In Table 4 i 1 , as long as fc'i 10 (for the case when the ZE conditions are 
siganalled using 1' as the criterion), this decrease of SNR's is not observed. In the 
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with the use the condition t^nCl), and the !EAR(i) process with, the use of the 

ineoualitu in cases where crit. i' is used in sianalling the 2E conditions. 

From the above Discussion we conclude that the use of crit. 2 in identifying the 
image regions as the sample functions of lEAR(i) processes, with either crit. i' or 
crit. in signalling the 2E conditions, and the use of crit. 2 in isolating the lEAR- 
IC(i) regions, with the use of crit. i, yield a compression performance which is 
roughly equivalent. If crit. 1 is used, the lEAR(i) process and the lEAR-ICCi) 
process, with the use of crit. i’ in indicating the ZE conditions, yield rougly 
similar compression ratios which are better than those obtained through the use 
of crit. Z’. 


For B bits/pel images, when crit. 1' is used in coding, the schemes using 
either the lEAP i‘ or the lEAR-lC(i) processes, yield almost the same performance 
regardless of whether the regions have been isolated using crit. i or crit. 2. 
However, the lEAR(i) and the IEAR-IC<i) processes, with the use of the criteria i 
and i', for assigning the regions and coding the ZE conditions, respectively, yield 
slightly better results than the use of crit. 2 and crit. 2', for the coding scheme 
using the lEAR(l) process. For 6 bit/pel images, however, the schemes in which the 
regions have been isolated using crit. i perform better than those in which crit. 2 
has been used, for both the lEAR(i) as well as the lEAR-IC(i) processes. 


The histogram equalization of an image is achieved by replacing the gray 
level value L|^ by the value L'|^, for all pels with the gray level Lf^, with L'j^ given 


Lk= AW '-k 

l=si 


where nj, is the nuftiber of pels at L| and n is the total, nurnber of pels in the 


image. 

Although the relative performance of these algor ithms^ for, histogram equalized 
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images foIlD.'.s ine patte’-n outlined above, some interesting trends show up. 
Firstla i* 1 i£ used in identifying the regions, the compression ratios 

achieved .are Eigmficantiy poorer than those achieved in the case of non- 
hisiograrr. equalized images, also the compressions do not improve significanly as 
the value of t is increased. This shows that the recurrence of equal adjacent- 
pixel-ralio values, for these images, is infrequent. The use of crit. 2, however, 
suffers from no such limitation, and it is possible in this case to obtain higher 
compression ratios, with larger values for e', without any significant sacrifice of 
SNR's Howeve*-, the ratios obtained for these images are still worse than those 
obtained for non-histogram equalized ones 

Let us consider the case when the condition t^n^p^ is being used to 
determine if a sufficient number of ZE conditions have ocoured, before deciding to 
code a region using the lEAR/iEAR-IC process. It is interesting to consider the 
compression performances when the bound for t is in a nbd. of npp^., i.e. the 
condition being used is t^n, n being in a nbd. of n^p^,. Representative values are 
shown in Table 4.i.5. The value of n^p^^ makes its appearance with a decrese in the 
no. of bytes used in coding the image, and a simultaneous, but slight increase in 
the SNR, as n is increased from n^pj.-! to n^pj^. However, compressions in the nbd. 
of npp^ are close to what they are at nppj. (differing by s little as about O.i 
percent), so that the minimum of the compression ratios is a flat minimum. A 
consequence of this result is that the value of n for which the compression is 
mininimum has been observed to remain approximately constant (remaining in the 
range npp|^+i) for various images and also for different values of the parameters 
£ and fc'. 

The tables 4.1.1 to 4.1.4 give the results for 3-bit quantization of the 


predictor coeffecient a. The majority of « values, for most real-life, image:= lie in 



between 0.9 and 1.0£, for the lEAR(i) process, and between 0.9 and i for the lEAR- 
IC(i) process, and a 5-bit quantization of the entire range of possible a values, i.e 
[0,13, for the IEAR-!C(i) process, is therefore wasteful. An example is shown in 
Table 4.1.6 and a comparison with the values in Table 4.1.1 shows that the 
performance for the coding procedure using 5-bit quantization of a is found to be 
consistently worse than that for the lEAR-IC(l) scheme which quantizes the a-value 
to 3 bits. 

A few final remarks will help to clarify the above discussion and summarize 
the main results. We have seen that the use of crit. 1' in indicating the occurance 
of 2E conditions performs better than the use of crit, 2', paticularTy when the 
regions have been isolated in accordance with crit. 1. The use of crit. 1' is also 
to be preferred because here the value of a' (and also a) can be identified 
directly with the quantizer characteristic, and the possibility of inordinately low 
SNR's can be minimized. For example, for the 7-level quantizer characteristic for 8 
bits/pel images, given in sec. 3.6, the choice of a“^[10 causes no substantial 
depreciation in the SNR values, but a'=15 causes a fall of SNR by about 4 dB for 
the image PLATH. Similarly, for the quantizer for 6 bits./pel images a'<;4 yields 
SNR's comparable with 3 bit DPCM, whereas the choice of t'i? causes a drop in SNR 
values by 2-3 dB. However, this deterioration in SNR's is depends on the image 
being coded, and also on the specifics of the algorithm being used and the 
scanning methodology employed. The drops in SNR observed for PLATH in the row- 
wise scanning scheme are not observed in the case when the image is sub-divided 
into smaller blocks. 

In Table 4.i.7 are given the percentage compresion relative to 3-bit (7- 
level) DPCM and the falls in SNR's relative to the latter, for different images, 
using crit. i in isolating the image regions as sample functions of IEAR(1)/IEAR- 
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CAFE 

!EAR(i) 

lEAR(i) 

lEAR-iC(i) 

assignment using crit. 2 

assignment using crit. i 

assignment using crit. i 

coding using crit. 2' 

coding using crit. i' 

coding using crit. i' 

e=4, £. =7 

£=8, £=15 

£=S, £.'=15 

'^oot=S 

"oot=^ 

^opt=^ 

n no. of bytes SNR, dB 

n no. of bytes SNR, dB 

n no. of bytes SNR, dB 


5 

5394 

25.34 

6 

5389 

25.37 

7 

5407 

25.39 

8 

5428 

25.44 


5 

5290 

25.08 

6 

5267 

25.10 

r-j 

/ 

5292 

25.23 

e 

5315 

25 .25 


5 

5346 

25.10 

6 

5315 

25.10 

7 

5330 

25.10 

9 

5398 

25 .37 


Compresion performance with the bound on t, nj in the nbd. of 


Table 4.1.5 





7i 


PLATH 


Pcw-wi»e scanning 

coaing using cnt. i’ 

ass; ’ 

wsing crit 1 

assignment us 

ing crit. 2 

il n 1 '■ 

*'‘DOt 

t^ nQ) 

^opt 




'^ODt=5 



2-8 

2-8 

bytes 42?w 


4554 

4619 

Sf^ 33 21 

- 

33.42 

33.55 

t-t' 5. If 

« » 

4-15 

4-15 

bytes 4ie: 


4130 

4142 

Shf? 25 ef 


29.08 

28.98 

t“i' b-lf 




bytes 4062 




SNR 29 22 





Th® performance of the coding scheme based 
cr the IE,4F-IC<’1^ process using 5-bit quantization of a 


Tatole 4.1.6 


Row-wise scanning 



CAFE 



PLATH 


CAFEHO 




lEAR(l) 

lEAR-IC(l) 


lEAR-ICd) 

a b 

a 

b 

a 

b 

a b 

a 

b 

a 

b 

8.2 0 09 

8.3 

0 16 

31.0 

0.95 

31.8 1.12 

3.3 

0.03 

2.8 

0.0; 

9.B 0.50 

13 6 

1.58 

33.7 

2.13 

38,7 5.21 

7.9 

0.54 

4.0 

0 3! 

15.9 2.20 

15,6 

2 15 

37.9 

5.94 

36.6 5.57 






PLATHHQ 



LISA 



LINCOLN 


lEARfl) 

lEAR-IC(l) 

IEAR(, 

1) 

lEAR-IC(l) 

lEAR(l) 

lEAR-IC(i) 

a b 

a 

b 

a, 

b 

a b 

a 

b 

a 

b ; 

21.4 0.18 

18.5 

0.12 

2 9 

0.02 

4.2 -0.15 

10.9 

-0.15 

10,3 

-O.lS 

25.3 0.66 

24.1 

1.96 

11.7 

0.28 

11.9 -0.20 

25.1 

0,49 

23.2 

0.5i 

— 


— 

16,9 

0.32 

16.2 0.32 

25.1 

0.49 

23.2 

0.55 

Notation: 










a Percentage improvement in compression over DPCM 




b;Decrease in SNR over 

■ DPCM in dB 







Performance of the coding processes using the proocessj using 

and the lEAR-ICd) process, using f^nd), relative to DPCM. 


Table 4.1.7 














No. of pixels pixels 



Gray— level values 



Gray-level volues 
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IC<i) processes, and using crit. i' in indicating the 2E conditions. These values are 
close to the best values which can be obtained using the imclementation of the 
adaptive schemes considered here. The blockwise methods give similar results. The 
table shows that compressions from about iO to 25 percent are possible (for non- 
histogram equalized images' with almost no degradation in the picture quality The 
higher compressions in the case of PLATH are expected because it contains long 
runs of pixels, all of the value 255. For a more complicated image, like CAFE 
modest compressions are achieved, (See the histograms of these images on page 7S. 
The images themselves are displayed at the end of this chapter). The fall in SNR 
corresponds with a deterioration in image quality, which shows up as either 
horizontal streaks of equal intensity, in the case of row-wise scanning, or as 
blobs of equal pixel values, in the case of block-wise scanning. 

4.2 Performance of non-adaptive methods 

In the non-adaptive schemes a value of a is decided a priori, and then the 
entire image is coded using this a as the value of the prediction coeffecient, for 
a coding scheme based on either the lEAR(i) or the lEAR-IC(i) processes, using 
either crit. i' or crit. 2' to signal the ZE conditions. For example, for the lEAR(i) 
process based coding method using crit. i' to indicate the ZE conditons, whenever 
the magnitude of the prediction error is ^ e', a zero is transmitted; if, on the 
other hand, the magnitude exceeds e’, a one, followed by a 3 bit codeword for the 
quantized error is transmitted. Note that the switch between the DPCM and the 
lEAR/IEAR-IC coding strategies does not occur here. 

T o detef'mine if there is a single coding scheme which consistently gives' the 
best compression performance, we follow the following procedure: Since we know 
that the most common a-values lie in a nbd. of i.O, for a given image, from a series 
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of a-values in this nbd., the value of a which gives the best compresion 
performance for a particular configuration of an algorithm, is first determined. A 
comparison of the compaction capabilities between the various possible 
configurations, for these values of a which yield the best compactions, is next 
made and the variant which yields the best results is identified. The procedure is 
repeated for other images, and if the same version of the algorithm yields the 
most desirable performance for different images, then this is the uniformly best 
scheme , 

The performance of the different coding algorithms, for some 
representative values of a is shown in Tables 4.2.1-42.2, for different choices of 
a', for the images CAFE and PLATH, and for the row-wise scanning case. Here the 
prediction error is quantized to 7 levels. Note that here actually eight levels are 
available for the quantization of the prediction error. The case of 7 quantization 
levels is being considered to provide a comparison with the adaptive methods and 
the results for a 8-level quanitzer are taken up later. 

The tables 4.2.1 and 4.2.2 allow us to make the following statements about 
the relative performance of the coding schemes; 

When the lEAR(l) process based coding scheme is used, for a given value of 
e', the best cumulative compression and SNR peformance is obtained by using a 
value of a equal to 1.00, though in some sporadic cases the values 1.01 and 1.02 
yield marginally better performance. The advantage of an a-value of 1.00 is that 
the multiplications in the transmitter and the receiver can be done away with, and 
time for computation need no longer be a bottle neck in increasing the 
transmission rates. For the lEAR-IC(i) process based coding scheme, the value of a, 
using which the best performance, for a given t', is achieved, keeps varying from 



image to image and also on the specific configuration of the algorithm useo 
Typical a-valuesj for which good performance is attained, in case of di-^fe'^ert 
images, and different parameter values, are 0.96, C.98 and 0.99. However, the lEAR(l) 
process based coding scheme with a=i.00 and which uses crit. i' to indicate the ZE 
conditions performs considerably better than the three other schemes possible, 
viz. the lEAR(i) scherrie with, a=i.00 and crit. 2’ to indicate the ZE conditions, and 
the lEAR-IC(i) scheme, with a suitable a and using either crit. i! or crit. 2' to 
signal the ZE conditions. Exactly the same conclusion is arrived at by using 6 
bits/ pel images and by scanning the image block-wise. A comparison between the 
Tables 4.1.1 and 4.2.1 and the tables 4.1.3 and 4.2.2, respectively, demonstrates the 
superiority of the non-adaptive lEAR(l) based coding scheme, with ffi=1.00, over any 
of the adaptive schemes. The chief reasons behind this fact are-' 

1. The adaptive schemes have to endure the brunt of the overheads that are 
required to signal the switch between the lEAR/IEAR-IC coding strategies, but 
convey no information to the receiver about the image itself. 

2. The process of identifying the image regions as the sample functions of the 
lEAR/IEAR-IC processes, do so after ensuring that a saving over DPCM would be 
achieved on coding these regions using the lEAR/IEAR-IC based methods, but such an 
economy cannot be ensured in the presence of quantization errors. That is, the 
latter coding methods may actually turn out to be more wasteful than DPCM in the 
presence of quantization errors. 

3. An 8 level quantizer is available in the case of the non-adaptive methods, as 
opposed to a 7 level one for adaptive schemes, and this allows the non-adaptive 
schemes to attain higher SNR's. 

This means that the scheme which performs best, among the adaptive and the 
non-adaptive methods, is the lEAR(l) process based coding scheme with ffi=1.00 and 
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ii 
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20.49 
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3447 
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Performance of the lE/IRfi? based coding method, 
a=i.00 and using crit. 1' to signal the ZE conditions 


Table 4Z.3 
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Row-wise scanning 
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b 

3. 

b 

a 
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a 

b 

a 

b 

e'=5 
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0.16 

32.1 

1.20 
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0.35 

fc'=2 
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0.14 
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-0.34 
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16.0 
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42.6 
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0.78 
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Block-wise scanning 
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£'=4 

29.2 
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-0.47 
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28.1 

1.36 

53.5 4.38 

43.9 1.44 

£'=7 

46.7 

3.16 

47.1 

1.94 

Notation 










a Percentage improvement in compression over DPCM 
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Performance of the non-adaptiv'e IEAR<i? based coding method, 
a=1.00 and using crit. 1' to signal the ZE conditions^relatii/e to DPCM 


Table 42.4 
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the 8 bits/pel images the values of e' may be taken to be 5 and 8, respectively 
without undue loss in SNR. However, image areas with small changes in pel intensities 
are now replaced by constant intensity regions and this degradation is noticeable 
particularly in low intensity regions in the form of either streaks or blotches of 
constant gray levels. 

4.3 Conclusions 

This work has explored the possibilities of achieving high compressions, in 
the case of coding of images, by exploiting the structure of some models for non- 
gaussian time series. It was observed in sec. 2.il, that of an immensity of such 
models which have been taken up in literature, only the first order model, the lEARCi) 
process, can be used to attain these ends. A new process, the lEAR-lC(l) process, was 
constructed and its coding performance compared vis-a-vis the lEAR(i) process. 

Two schemes of image coding using schemes based on these methods were taken 
up: the adaptive and the non-adaptive one. For the scheme which changes the 
prediction coeffecients adaptively, it was found that the method using crit. i for 
identifying the sample functions IEAR(l)/IEAR-IC(i) processes and crit. 1' for 
signalling the occurance of the ZE conditions, almost always performs as good as, if 
not better than, any other adaptive scheme based on these processes . However, the 
arguments in section 4.2 reveal that the scheme which performs best, among all 
possible variants of the the adaptive and the non-adaptive methods, based on the 
lEAR(i) and the 1EAR-IC<1) processes, is the lEAR(i) process based coding scheme with 
a=ii.OQ and which makes use of crit. i' to signal the ZE conditions. The implementation 
complexity and the time for computation taken by this scheme is insignificant when 
compared with the complexity and the processing time for the adaptive schemes. For 
images consisting predominantly of large solid areas, for example, head and shoulder 
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gaussian time series. It was observed in sec 2 11, that of an immensity of su 
models which have been taken up in literature, only the first order model, the EAR 
process, can be used to attain these ends A new process, the lEAR-IC(l) process, w, 
constructed and its coding performance compared vis-a-vis the lEARfl,' process. 
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images with constant background, this method may give rise to compression ratios of 
almost 2 relative to DPCM. For images with a large amount of detail in them, small 
improvements over DPCH are achieved 

In this study, comparison of the IEAF.'..i>/IEAR-IC(i) process based schemes has 
been made with 3 bit DPCM For higher bit rate DPCM schemes these methods will 
achieve better percentge compression over DPCM, because the occurance of a ZE 
condition can still be indicated using one bit, and the fraction of saving made 
relative to DPCM signalling of this condition would improve. However, it must be 
remembered that the scheme is a variable length coding scheme, with all the 
concomitant problems, and for noisy channels it may become imperative to use a 
error control scheme in conjunction with this scheme, and this may negate the 
compressions achieved somewhat. It is suitable for image storage purposes where the: 
problems associated with the presence of noise are virtually non-existent. 

The lEAR(i) process based coding method, with a=i.00 is, under certain 
circumstances, identical. to a special case of a variable length coding scheme, which; 
can be used to code the quantized prediction error values obtained in the case of 
DPCM, with the predictor coeffecient «.'=i.00. To see this, consider the 7-level 
quantizer for 8 bits/pel images, given in sec. 3.6. This quantizer quantizes the; 
interval t-5,53 to zero. If, in the mentioned lEAR(i) process basesd scheme, e' is 
chosen to be 5, then this scheme is identical to a coding scheme which assigns! 
variable length code words, to the quantized prediction error obtained in the case | 
of DPCM, in the following way: it assigns a one bit word to the zero quantized 
prediction error and assigns a four bit word (one bit indicating that this level is ; 
not equal to zero and three bits giving the reconstruction value) to the other 
quantization values. The above argument holds for any quantizer having an odd 
number of reconstruction levels, i.e. which quantizes a certain interval to the zero 
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va*je £ i'-e.i:. auantiZer, choosing t greater than or equal to 8, imolicitly 

ccr .e'’*.= - .t fe* quantize'" to a t level quantizer and is again equivalent to the 

above variable length coding scheme, a choice of t' less than 8 actually converts the 
B le’ quantizer a c le'.-el quantizer and the above argument is again valid. Thus 


we see '.ra*. the schere i-.ith the best compression performance turns out to be simply 
a variable length coding scheme, which can be used to code the quantized prediction 
error values, obtained in the case of a DPCM scheme which makes use of the same 
quantizer as the lEAR(l) process based scheme. 


This study, thus disputes the claim that superior coding of images can be 
achie'-.'ed by the schemes based on the IEAP(i)/EAR-IC(l) processes, as in [i], and 
establishes that the scheme which performs best, within this family of schemes, is in 
no way different from a simple variable length coding scheme which can be used in 
conjunction with DPCM 


4.4 A note on the visual degradation of images encoded using the EAR(1)/IEAR-IC(1) 
process based schemes 

The images encoded using the procedure based on the IEAR(i)/IEAR-IC(l) 
process have edges which are not as sharp as those in the original image. This 
jaggedness or blurring of edges causes an overall decrease in sharpness in the 
appearance of the image Tnis phenomenon is similar to that occur ing due to the 
quantization errors in the case of DPCM but is aggravated in this case, particularly 
if t' IE chosen large 

Quantizing the range C-e',t’3 as zero has the effect of causing areas of equal 
intensity, with little variation, and gives rise to horizontal streaks if the image is 
scanned row-wise and blotches of constant intensity if it is scanned block-wise. Such 
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4.5 Suggestions for further work 

The results in this thesis would preclude much further work in applying the 
EAR process for image coding applications. However, a direct extension of this 
work would be to compare the above results, obtained in the EAR(i)/EAR:-IC(i) 
process based coding of images, with the Huffman coding of the quantized 
prediction error values. Huffman coding could be achieved after obtaining the 
estimates of the probability with which a particular reconstruction level is 
reached, duing the quantization process. Studies could also be undertaken on the 
rate distortion theory of these sources. 

The above pessimism, however, should not affect the general interest in the 
EAR process itself. For one, these processes were not constructed with coding 
applications in mind. The process and its variants are handy in modelling data 
which are known to be patently non-gaussian, like intervals and counts. Also, the 
process provides a way of generating non-gausian time series, without resorting to 
non-linear sytems. From, a theoretical and a practical viewpoint, this research 


would take the directions suggested by Ci53-C2Q]. 



References 


1. J. D. Gibson, "Data Compression of a First Order Intermittently Excited AR 
process" . 

2. A. K. Jain, "Image Data Compression-- A Rei/iew", Proc. IEEE, Vol. 69, March i98i, 

3. A. N. Netravali and J. □. Limb, "Picture coding- A Ret^iew", Proc. IEEE, Vol. 68, 
March 1980, 

4. H. G. Musmann, P. Pirsch and H. J. Gallerl, "Adi,-'-anaes in Picture Coding", Proc. 
IEEE, Vol. 73, April 1985. 

5. W. K. Pratt, "Digital Image Processing", 2nd Edition, Wiley Interscience. 

6. A. K. Jain, P. M. Farelle and V. R. Algagi, "Image Data Compression", in Digital 
Image Processing Techniques, Academic Press. 

7. W. F. Schreiber, " The measurements of 3rd order probabililty distributions of 
jy signals", IRE Trans. Inf. Th., IT-2, Sept. 1956. 

8. M. Kunt, A. Ikonomopoulos and Kocher, "Second Generation Image Coding 
Techniques", Proc. IEEE, Vol. 73, April 1985. 

9. Robert M. Gray, 'Vector Quantization", IEEE ASSP Magazine, April 1984. 

10. D. J. Granrath, "The role of Human yisual Models in Image Processing", Proc. 
IEEE, Vol. 69, May 1981. 

11. C. L. Mallows, "Linear Processes are nearly Gaussian", J. App. Prob., Vol. 4, 
1967. 

12. T. Berger, "Rate Distortion Theory", Prentice Hall, 1971. 

13. A. Papoulis, "Probability, Random Variables and Stochastic Processes", McGraw 


Hill, 1984. 



References 


i. J. D. Gibson, "Data Compr^ssinr^ ^ r- ^.r,. tx 

s-ion of a First Order Intermittently Excited 

process" . 


2. A. K. J3in, Data Co^raaaioa: ,1 Ra,.ia,v", Proc. EEE, Vcl, 59, 1981, 

3. A. N. Netravali and J. 0. Limh ^ a r, 

n>b, Picture coding: A Reinew", , 

March 1980 . 

4. H. G. Musmann, P. Pirsch and H T ■ r. 

a H. J. Gallert, ,4dt^ances in Picture Coding". Prc 

IEEE, Vol. 73, April 1985. ' 

5. W. K. Pratt, "Digital Image ProcesKina'' irw,i- .ri t , 

'-(-essing , Znd Edition, Wiley Interscience 

6. A K. Jain, P. M. Farelle and u r 

V. R, Algagi, "Image Data Compression", in Digit 

Image Processing Techniques, Academic Press 

7. W. F. Schreiber, " The measuremor^^■tr ^ ^ ^ ■ 

<^rements of 3rd order probabiUUy distributions i 

TV signals", IRE Trans. Inf. Th., it_ 2 ^ Sept 1956 

8. M. Kunt, A. Ikonomopoulos and ^ r. 

and Kocher, Second Generation Image Codu 

TechniQues", Proc. IEEE, Vol. 73, April 1985 > 

9 . Robarl M, Gray, 'Vector Scantia^tio..,", IEEE ASSP Magazine, Anril 1534. 

10. D. J. Granrath, "The role of m-j -i ■ t ^ 

Human yisual Models in Image Processing", Prdi 

IEEE, Vol. 69, May 1981. ^ 

11. C. L. Mallows, "Linear Processfur z' ■ ,v , 

messes are nearly daussian", J. ^ , 

1967. I 

I 

12. T. Berger, "Rate Distortion Theory", Prentice Hall, 1971. 

13 A Pacculia, "Prct,abil„y, RanCc™ l-ariablea and Stcchaatoc Procesaas”, McGra 


Hill, 1984. 


14. D. P. Gaver and P. A. W. Lewis, "First Order AR Gamma Sequences and Point 
Processes", Adv. App. Prob., Vol. 12, 1980. 

15. A. J. Lawrence and P. A. W. Lewis, "The Exponential ARM A, EARMA(p..q} process", 
J. Royal Stat. Soc., Vol. 42, 1982. 

16. A. J. Lawrence and P. A. W. Lewis. "A Nett' AR time series model in Exponential 
Variables", Adv. App. Prob. Vol. 13, 1981. 

17. A. J. Lawrence and P. A. W. Lewis,"/! Mixed Exponential time series model", 
Management Science, Vol. 28, 1982. 

18. Don. H. Johnson and P. Srinivas Rao, "Generation of Linear First Order time 
series" . 

19. Don. H. Johnson and P. Srinivas Rao, "Proprties and Generation of Non-gaussian 

• 

time series". 

20. Don. H. Johnson and P. Srinivas Rao, "A First Order AR model for Non-gaussain 
time series". 

21. J. Makhoul, "Linear Prediction ■- A Tutorial Review", Proc. IEEE, Vol. 63, April 
1975. 

22. A. K. Jain, "Adi/ances in Mathematical Models in Image Processing", Proc. IEEE, 


Vol. 69, May, 1981. 



Reconstructed images 

The images included in this section have been coded and reconstructed 
using the specific version of the algorithm indicated alongside each reproduced 
image. The number of bytes required to code the image and the SNR's obtined are 
also indicated. While considering the degradations caused in the images, the 
observations made in sec. 4.4 must be borne in mind. 


PLATH 




CAFE 




3. Coded using non-adaptive lEARd) based 
scheme, a=l.CKI, using crit. 1', £.'=15 
Row-wise scanning 
SNR=26.51dB, bytes=4a33 



4. Coded using ncn-adaptive IEAR<1) based 
scheme, a=i.OO, using crit. i', £=15 
Block-wise scanning 
Sh«=2458 dB, bytes=4417 





5. Coded using the adaptive EAR(1) based 
scheme, using cril. 2, crit. 2', ti^rKl) 
t=4, e'=i5 
Row-wise scanning 
SHR=25.59 dB, bytes=5i86 


6. Coded using the adaptive IEAR<i) based 
scheme, using crit. 2, crit. i', t2:n<l) 

£=4, £=15 
Row-wise scanning 
SNR=27.57 dB, byte5=5406 


7. Coded using the adaptive lEAR-IC(l) based 
scheme, using crit. 1, crit. 1', t;^n<l) 

£bs 5, fc^=8 

Row-wise scanning 
SNR=28.88 dB, bytes=:5634 



8. Coded using the adaptive EAR-IC(i) based 
scheme, using crit. i, crit. 1', t2:n<l) 

£=15, £'=15 
Row-wise scanning 
SNR=26.89 dB, bytes=5i88 


t 


BB-I ffo '-M ^ " f£A' 


